* 


OTIC  FILE  COPJ 


j 


i 

a 


SEP  1 1 1980 


PRODUCTIVITY:  EFFECTS  OF  INFORMATION 
FEEDBACK  ON  HUMAN  ERRORS 


H.  Mcllvaine  Parsons 


DTIO 


I£LECTE 

AUG  0  3  1987 


i 


FINAL  REPORT 
July,  1980 


Preparation  of  this  report  was  supported  by  the 
Engineering  Psychology  Program,  Office  of  Naval  Research 

t 

ONR  Contract  N00014-77-C-0024  Work  Unit  NR197-044 


Approved  for  public  release;  distribution  unlimited 


Reproduction  in  whole  or  in  part  is  permitted  for  any  purpose 
of  the  United  States  government 


t>  V 


REPORT  DOCUMENTATION  PAGE 


T7.r:T:H* 


I.  REPORT  NU 

Final  Report 


4.  TITLE  (and  Subtitle) 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 
2.  GOVT  ACCESSION  NO.  I  3.  RECIPIENT'S  CATALOG  NUMBER 


Productivity:  Effects  of  Information 
Feedback  on  Human  Errors 


5.  TYPE  OF  REPORT  &  PERIOD  COVERED 

Final  Report 

1  Oct.  1977  -  31  Jan.  1979 

6.  PERFORMING  ORG  REPORT  NUMBER 


7.  AUTHOR (s) 


H.  Mcllvaine  Parsons 


8  CONTRACT  OR  GRANT  NuMBERlj) 


N00014-77-C-0024 


10  PROGRAM  ELEMENT  PROJECT  TASK 
AREA  g,  WORK  UNIT  NUMBERS 


NR  197-044 


13.  NUMBER  OF  PAGES 


9  PERFORMING  ORGANIZATION  NAME  AND  ADORESS 

Institute  for  Behavioral  Research 

2429  Linden  Lane,  Silver  Spring,  MD  20910 


1 1 .  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

Engineering  Psychology  Program 
Office  of  Naval  Research  -  Code  455 
Arlington,  VA  22217 

74.  MONITORING  AGENCY  NAME  &  AODRESS  lit  different  from  Controlling  OMicel  i  IS  SECURITY  CLASS,  to*  this  reponl 

|  UNCLASSIFIED 


16  DISTRIBUTION  STATEMENT  lot  th.i  Report) 


IS  Dt  CLASSIFICATION, DOWNGRADING 
SCHEDULE 


Approved  for  public  release;  distribution  unlimited 


17.  DISTRIBUTION  STATEMENT  lot  the  abctroct  entered  in  Block  20  ildilfe'ent  Irom  Reportl 


18  SUPPLEMENTARY  NOTES 


19  KEY  WORDS  (Continue  on  reverie  side  it  necenary  and  identity  by  b'rvii  mmtipr  I 

Copying 

Information  Feedback  Detection 

Human  Errors  Coding 

*  Productivity  Reinforcement 

Information  Processing  Consequat i on  . 


20  ABSTRACT  (Continue  on  reverse  side  it  necessary  and  identity  bv  t><  k  h  numb**' 

This  report  describes  a  research  program  exploring  the  effects  oi  verbal 
conscquators  (information  feedback  or  re i n force rs/pun i sher> )  on  human 
errors  in  copying,  detection,  and  cod  ini*  tasks.  The  program  undertook 
three  experimental  studies,  a  study  of  consequat or  ratings,  and  a  mapping 
of  consequat ion  variables,  but  failed  to  produce  evidence  of  dittcrcntial 
effects,  though  much  was  learned  about  the  methodology  of  inducing  errors 
experimentally.  The  ratings  study  has  implications  tor  job  pert ormance . 


__  FORM 

DO  ’  Jon  73  1  473 


SECURITY  Cl  ASSIUCA’UIN  Of  TinSPAi.l  (  Whrn  Data  t  nterrd  i 


TABLE  OF  CONTENTS 


Introduction 

Copying  Study 

Consequator  Rating  Study 

Mapping  of  Consequation  Variables 

Detection  Study 

Coding  Study 

References 


Pages 

1 

3 

16 

26 

27 

33 

36 


Acce " 

N'TIS  Ch 
D’lC  TA 

U.  ..mviiii. 

J . t > f  1  , 

By 

L)  •  t  .• 


A-/ 


INTRODUCTION 


In  the  growing  concern  about  productivity  in  both  military  and  civilian 
domains,  error  reduction  has  been  overshadowed  by  interest  in  output  and  at¬ 
tendance,  in  productivity  research.  The  project  described  in  this  final  report 
on  Contract  N00014-78-C-0024  investigated  error  reduction  in  productivity- 
related  human  information  processing  tasks.  Such  tasks  can  be  critical  to 
all  kinds  of  naval  operations  and  maintenance.  The  tasks  initially  selected 
for  examination  in  this  project  were  copying,  detecting,  coding,  and  categoriz¬ 
ing;  the  last  was  dropped.  Considerable  experimental  effort  was  devoted  to 
the  others,  though  it  must  be  viewed  as  exploratory  rather  than  definitive. 
Outside  the  psychomotor  and  sensory  fields,  relatively  little  experimental 
research  has  been  directed  explicitly  at  variables  influencing  human  error  in 
repetitive  information  processing  performance  (rather  than  learning).  Such  is 
especially  the  case  for  within-subject  designs.  Whenever  a  new  domain  is 
studied,  more  exploration  may  be  needed  than  was  anticipated,  as  in  this  proj¬ 
ect  . 


The  category  of  independent  variables  of  interest  can  be  labelled  as  "con- 
sequation,"  which  may  be  interpreted  to  include  both  information  feedback  and 
reinforcement.  Both  of  these  terms  mean  that  an  event  that  follows  some  per¬ 
formance  may  influence  its  future  occurrence.  A  very  large  number  of  variables 
are  involved  in  the  effects  of  consequation,  as  a  technical  report  published 
as  part  of  this  project  attests  (Parsons,  1979).  The  form  of  consequation  ex¬ 
amined  in  the  project  was  verbal,  in  particular  verbal  feedback  that  might  be 
viewed  as  either  informational,  or  derogatory/complimentary,  or  both.  Re¬ 
markably  little  investigation  has  been  accorded  aversive  verbal  consequation 
(other  than  right-wrong  studies  of  knowledge  of  results),  though  unfavorable 
comments  are  common  events  in  organizational,  family,  and  other  situations. 

From  a  theoretical  viewpoint,  the  project  reflected  an  interest  in  re¬ 
lating  to  each  other  the  cognitive  area  of  information  feedback  (or  knowledge 
of  results)  and  the  behavioral  area  of  reinforcement  (and  punishment)  in 
operant  conditioning.  The  information  an  individual  receives  after  and  about 
performance  and  some  hedonic  consequence,  reinforcing  or  punishing,  may  oc¬ 
cur  together  either  from  separate  circumstances  or  in  the  same  verbal  comment 
from  another  person.  Some  cognition-oriented  analysts  (e.g.,  Annett,  1909) 
have  preferred  to  attribute  all  effects  of  consequation  to  its  informational 
content.  Rehaviorists  tend  to  give  the  entire  credit  to  motivational  vari¬ 
ables,  reinforcers  or  punishers,  even  though  their  views  of  behavior  include 
discriminative  stimuli  as  behavioral  outcomes  (and  discriminative  stimuli, 
or  "discriminators,"  can  be  considered  "information").  In  reviews  of  the 
Hawthorne  studies  (Parsons,  1 9 ”4 ,  1978),  the  principal  investigator  in  this 
project  concluded  that  t  lie  Hawthorne  effect  resulted  from  a  combination  of 
information  feedback  (discriminator  consequation)  and  monetary  reward  (re¬ 
inforcement  conseipiat i on )  contingent  on  performance  that  increased  produc¬ 
tivity.  His  Hawthorne  analyses  were  responsible  for  proposing  the  present 
project,  though  it  was  directed  at  error  reduction  rather  than  an  increase 
in  output. 
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Both  informational  and  motivational  variables  were  included  in  the  tech¬ 
nical  report  already  mentioned  and  described  briefly  in  this  final  report  as 
Mapping  of  Consequation  Variables,  and  they  are  implied  in  the  section  on 
Consequator  Rating  Study,  an  inquiry  into  self-reports  about  such  variables. 
Relationships  between  the  two  types  of  variables  constituted  the  underlying 
reason  for  the  three  experimental  studies  undertaken  and  reported,  though  the 
two  studies  which  actually  manipulated  consequation  as  a  variable  must  be  re¬ 
garded  as  only  a  preliminary  and  exploratory  step  toward  examining  such  rela¬ 
tionships.  The  studies  are  described  under  the  headings  Copying,  Detection, 
and  Coding,  the  three  information  processing  tasks  mentioned  earlier. 

The  copying  study  failed  to  produce  evidence  that  information  feedback 
reduced  the  frequency  of  errors  or  that  differences  in  the  nature  of  the 
feedback  affected  error  frequency.  Following  a  substantial  exploratory  phase, 
data  were  obtained  from  only  two  subjects  in  the  detection  study  before  this 
report  was  submitted,  and  these  data  had  not  been  processed.  The  coding  study 
also  had  many  exploratory  sessions  to  explore  methods  of  inducing  error  but 
never  reached  the  point  of  examining  different  types  of  information  feedback. 
Thus,  much  was  learned  about  methodology  for  inducing  errors  in  human  informa¬ 
tion  processing  but  the  project  failed  to  demonstrate  differential  effects  from 
information  feedback. 

Among  the  lessons  learned  was  the  difficulty  of  inducing  error  at  levels 
that  would  make  it  possible,  without  an  exorbitant  number  of  trials,  to  deter¬ 
mine  whether  differences  in  results  from  experimental  conditions  were  conse¬ 
quential  and  significant.  Due  probably  to  the  demand  characteristics  of  the 
experimental  arrangements,  subjects  tended  to  try  to  keep  from  making  errors 
much  as  they  had  done  in  tests  at  school.  As  a  result,  the  tasks  had  to  be 
made  more  difficult  or  the  trials  and  sessions  longer  or  more  numerous  than 
otherwise  might  have  been  necessary,  especially  in  a  within-subjects  design. 
Though  more  subjects  would  be  needed  and  the  costs  would  be  greater,  a  between- 
subjects  design  may  be  viewed  as  preferable  for  this  sort  of  investigation. 

During  the  course  of  this  project  but  not  as  an  integral  part  of  it,  the 
principal  investigator  gave  an  invited  paper  at  the  Symposium  on  Productivity 
Enhancement:  Personnel  Performance  Assessment  in  Navy  Systems,  of  the  Navy 
Personnel  Research  and  Development  Center,  San  Diego  (Parsons,  l!)"). 

Appreciation  is  extended  to  the  project's  Scientific  Officer,  Dr.  Martin 
Tolcott,  Director,  Engineering  Psychology  Program,  for  his  patience  with  un¬ 
avoidable  problems  that  impeded  the  project's  timely  completion,  and  to  Mr. 
Kenneth  R.  Williams,  Research  Assistant  at  the  Institute  for  behavioral  Research, 
whose  collaboration  in  collecting  data  and  in  other  aspects  of  the  project  was 
conscientious  and  invaluable. 

All  participants  in  the  research  signed  informed  consent  forms,  which, 
together  with  the  procedures,  were  approved  by  the  Committee  for  the  Ptotci - 
tion  of  Humans  in  Research,  the  inst i tut iona 1  review  board  oi  tin  Institute 
for  Behavioral  Research. 
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COPYING  STUDY 


A  familiar  human  information  processing  task  is  copying  or  transcribing, 
either  between  input  modalities  or  within  a  modality.  In  the  latter  in¬ 
stance  an  individual  may  read  from  one  kind  of  visual  presentation  and  copy 
the  material,  without  recording  it,  on  another  kind  of  visual  display.  It  is 
common  experience  that  errors  occur  in  copying.  A  study  was  conducted  to  in¬ 
vestigate  the  effects  of  several  kinds  of  consequation ,  or  information  feed¬ 
back,  on  copying  errors. 

The  experimental  strategy  was  to  use  a  within-subjects  design  and  have 
a  small  number  of  participants  (subjects).  No  large  pool  of  relatively 
homogeneous  individuals  was  readily  available  for  selecting  randomly  a  sub¬ 
stantial  number  of  participants  for  different  treatment  conditions.  Since 
with  a  within-subjects  design  it  is  essential  to  minimize  learning  that 
could  confound  the  effects  of  the  states  of  the  independent  variable(s),  it 
seemed  necessary  to  select  material  to  copy  that  already  was  well  learned, 
that  is,  was  well  established  in  the  subject's  repertoire.  It  was  also 
advisable  to  schedule  a  large  number  of  trials  prior  to  the  data-taking  trials. 

Since  most  Americans  who  have  at  least  graduated  from  high  school  have 
extensively  discriminated  and  manipulated  numbers,  learning  of  individual 
numerals  will  not  occur  during  an  experiment.  (The  participants  in  this 
study  responded  to  advertisements  posted  at  a  nearby  junior  college,  although 
not  all  were  students.)  Numbers  can  be  selected  and  arranged  sequentially  by 
means  of  random  number  tables  so  that  sequences  and  arrays  will  be  different 
but  equivalent.  Their  differences  should  obviate  learning  in  the  form  of 
memorization  of  sequences  or  arrays  in  repeated  presentations  during  the  study, 
and  their  equivalence  should  prevent  confounding  due  to  differences  in  the 
stimulus  material.  For  these  reasons  rows  of  random  numbers  consitutcd  the 
material  to  be  copied. 


Participants 


Ten  participants  (college  students  or  their  acquaintances)  performed  in 
the  experiment  proper,  six  men  and  four  women.  Toy  were  told  that  the  study 
involved  clerical  tasks  but  were  not  informed  about  the  information  feedback 
conditions.  They  signed  consent  forms  approved  by  the  Committee  for  Protec¬ 
tion  of  Humans  in  Research  (institutional  review  board)  of  the  Institute  for 
Behavioral  Research,  which  approved  the  procedures  in  the  study.  Seven  other 
participants  took  part  in  an  earlier  exploration  of  procedures. 

P  roc  t*diJTe_s 

l  ach  participant  performed  m  a  three -hour  session,  .uu!  -a  v  ot  them  per¬ 
forated  in  a  similar  session  .i\  to  II  Javs  earlier.  I  ach  of  the  other  tour 
had  taken  part  in  a  session  .even  to  ..'a  days  earlier  wlun  the\  carried  out  a 
siinil.ii  number  copying  task  in  t  he  procedural  cxplorition. 
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Each  participant  encountered  five  conditions.  The  first  and  last  were 
baseline  conditions,  without  feedback.  In  the  others,  information  feedback 
took  three  forms:  "You  got  most  of  them  right,"  "You  got  some  of  them  wrong," 
and  "You  got  some  of  them  wrong.  Mistakes  with  numbers  can  be  most  unfor¬ 
tunate  in  some  clerical  tasks."  There  were  five  orders  of  encountering  these 
conditions  among  the  ten  participants. 

In  each  condition,  a  participant  had  to  copy,  on  a  typewriter,  random 
numbers  that  were  projected  on  a  white  surface  from  a  transparency.  The 
numbers  were  arranged  in  columns  or  blocks  of  ten  rows  (two  sets  of  five) 
each.  For  seven  of  the  participants  there  were  eight  numerals  per  row,  for 
the  other  three  nine,  except  in  the  first  two  baseline  blocks,  which  had  ten. 
Accuracy  performance  on  these  two  was  examined  to  determine  whether  a  parti¬ 
cipant  should  copy  eight  or  nine  numerals  in  a  row,  nine  being  more  difficult. 
Each  condition  had  five  blocks  or  columns  of  numerals  with  eight  or  nine  per 
row.  Thus,  a  participant  copied  400  or  450  numerals  per  condition  (plus  the 
additional  two  blocks  in  the  initial  baseline).  Only  one  row  was  displayed 
at  a  time,  for  five  seconds.  The  participant  had  to  wait  ten  seconds  before 
starting  to  type  (after  receiving  a  "Go").  A  participant  could  pace  himself/ 
herself  in  typing  but  was  given  the  visual  instruction,  "Type  the  numbers  as 
fast  as  you  can,"  at  the  start  of  each  experimental  condition.  Participants 
were  not  permitted  to  make  corrections.  A  new  row  of  numerals  was  displayed 
when  the  participant  pressed  the  carriage  return.  The  participant  sat  on  the 
other  side  of  a  partition  from  the  investigator,  facing  the  projected  display. 

A  lamp  illuminated  the  typewriter,  and  overall  illumination  was  kept  constant. 

There  were  two-minute  intervals  between  blocks/columns  and  five  minutes 
between  conditions.  In  the  feedback  condition,  a  feedback  message  was  pro¬ 
jected  for  30  seconds  after  each  block  of  numerals,  beginning  60  seconds  after 
the  start  of  the  interval  (to  make  it  appear  plausible  that  the  investigator 
had  actually  counted  the  number  of  errors).  The  time  to  complete  each  block 
was  recorded  but  the  errors  were  not  counted  until  after  the  session  (except 
for  the  two  initial  baseline  blocks).  Though  the  feedback  messages  were  not 
adjusted  to  actual  performance,  "most  right"  and  "some  wrong"  did  match  that 
performance. 

Within  a  session,  the  rows  of  random  numbers  in  any  block  were  unique; 
there  was  no  repetition  of  rows.  Numbers  were  drawn  from  a  random  number 
table.  All  participants  encountered  the  same  blocks  but  these  were  assigned 
differently  among  conditions  for  different  subjects.  Virtually  all  of  the 
blocks  in  the  principal  session  had  been  used  in  the  prior  session,  but  with  a 
different  assignment  to  conditions  for  the  same  participant. 

At  the  end  of  the  principal  session,  each  participant  answered  eight 
questions  about  the  study.  These  included  rating  the  feedback  conditions  on  a  7- 
point  unpleasant -pleasant  scale. 

Procedural  F.xplorat  ion 

For  two  reasons  it  appeared  essential  to  have  a  procedure  that  would  ac¬ 
tually  result  in  some  errors  consistently  in  each  participant.  One  reason 
was  the  content  of  feedback.  It  seemed  unwise  to  tell  a  participant  "ton 
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got  some  wrong"  if  the  participant  believed  this  was  unlikely.  In  copying 
tasks,  participants  can  self-detect  about  50%  of  their  errors  (Bailey,  1978). 
Second,  if  a  participant  made  no  errors  or  almost  none,  it  would  not  be  pos¬ 
sible  to  demonstrate  any  effects  of  variation  in  the  independent  variable. 

Much  effort  was  expended  in  ten  three-hour  procedural  exploration  sessions 
developing  a  task  that  would  produce  error  rates  between  1  and  18  percent. 

The  average  achieved  was  about  9  percent.  It  was  felt  that  too  high  a  rate 
would  be  as  insensitive  as  one  too  low.  There  was  a  considerable  range  of 
inter-subject  differences. 

The  task  finally  selected  obviously  exploited  the  limits  of  short-term 
memory.  The  use  of  random  digits  in  measuring  "immediate  memory  span" 
(Woodworth  and  Schlosberg,  1954,  p.  696)  is  hardly  novel.  Errors  increase 
sharply  between  8  and  9  digits  (Woodworth  and  Schlosberg,  1954,  p.  705). 
However,  in  earlier  experimental  work  and  in  intelligence  testing,  apparently 
individuals  started  to  repeat  the  series  as  soon  as  its  oral  or  visual  pre¬ 
sentation  ended  rather  than  after  some  fixed  interval,  as  in  the  present 
study.  More  recent  research  in  human  information  processing  has  exploited 
strings  of  random  digits  to  examine  the  effects  of  rehearsal  prevention-- 
the  Brown  Peterson  technique  (Baddeley,  1976)  and  of  rate  of  presentation 
(summarized  in  Kintsch,  1969),  and  instead  of  recall  has  examined  recognition 
(e.g.,  Shepard  and  Teghtsoonian,  1961)  and  a  "missing  scan"  procedure  where 
the  subject  must  say  which  digit  is  missing  in  a  series. 

Additional  Methods.  Other  methods  explored  to  produce  appropriate  error 
rates  included  the  following: 

1.  The  presentation  time  of  a  row  of  digits  was  shortened  to  3  seconds. 
As  with  Bailey  (1978)  and  in  some  cases  of  increasing  the  digit  presenta¬ 
tion  rate  (equivalent  to  shortening  the  row  presentation  time) ,  a  sub¬ 
stantial  number  of  errors  could  be  assured  through  briefer  displays. 
However,  this  method  was  rejected  because  (1)  presentation  time  was  mea¬ 
sured  by  stopwatch  and  was  imprecise  and  (2)  participants  too  often  om- 
mitted  a  row  entirely  or  fabricated  one  out  of  thin  air. 

2.  The  delay  interval  was  shorter  (5  seconds)  or  longer  (15  seconds). 

Not  enough  errors  occurred  with  the  shorter  interval ,  and  too  many  with 
the  longer. 

3.  With  continuous  rather  than  row-by-row  display  and  copying  of  a  col¬ 
umn,  defocussing  the  transparency  projector  made  the  displayed  digits 
more  difficult  to  discriminate.  Though  this  technique  was  effective 

in  producing  errors  (as  might  be  expected),  it  seemed  likely  that  partic¬ 
ipants  might  improve  greatly  with  practice  (even  without  item-by-item 
feedback),  and  in  any  case  the  projector  could  not  be  adjusted  to  pro¬ 
duce  equal  blurring  in  all  rows.  It  would  be  better  to  show  slides  of 
digits  that  had  been  prepared  with  uniform  blurring. 

4.  With  continuous  rather  than  row-by-row  display,  the  typewriter  was 
placed  at  a  90°  angle  to  the  display  so  the  participant  had  to  shift 
vision  that  amount  from  examining  the  display  to  seeing  the  keys  and 
typing  paper.  (Incidentally,  all  participants  were  required  to  type 
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with  a  forefinger  so  prior  practice  in  touch  typing  could  be  ruled  out.) 
Because  participants  rehearsed  the  digits  as  they  shifted  their  gaze 
and  the  interval  between  looking  and  typing  was  short,  this  technique 
failed  to  produce  errors. 

5.  With  continuous  rather  than  row-by-row  display,  the  number  of 
digits  in  a  row  was  varied.  Error  rates  were  very  low  with  5,  6,  and 
even  7  digits,  and  in  some  participants,  due  to  grouping  and  rehearsing, 
they  remained  low  even  with  8,  9,  and  10  digits.  Individual  differ¬ 
ences  were  considerable,  with  a  biochemistry  technician  and  a  super¬ 
market  checkout  clerk  doing  especially  well. 

6.  With  continuous  rather  than  row-by-row  display,  the  investigator 
uttered  random  numbers  while  the  participant  was  copying  the  column. 
There  was  no  effect. 

In  addition,  the  earlier  sessions  led  to  a  number  of  other  changes: 

1.  The  feedback  display  for  augmented  "wrong"  was  changed  from  "You 
got  some  of  them  wrong.  Mistakes  are  pretty  stupid  in  a  task  like  this, 
after  two  participants  laughed  at  it  and  said  it  was  bizarre. 

2.  Instructions  were  changed  from  "Copy  the  numbers  as  fast  as  you  can" 
to  "Type  the  numbers  as  fast  as  you  can"  after  participants  reported 
that  instructions  to  "copy"  meant  they  should  not  make  any  mistakes. 
"Copy"  implicitly  conveys  stimulus  control.  What  else  is  there  to  type 
except  what  is  displayed? 

3.  One  experimental  condition,  in  which  instructions  included  "Try 
not  to  make  mistakes,"  was  dropped  because  (1)  there  was  insufficient 
time  to  include  it  in  a  three-hour  session,  (2)  avoiding  mistakes  seemed 
to  be  implicit  in  a  copying  task  even  when  the  word  "copy"  was  changed 
to  "type,"  and  (3)  it  involved  another  variable  than  feedback.  It  could 
be  introduced  in  subsequent  research  to  test  the  effects  of  instructions 
combined  with  feedback. 

Results 


Criteria.  A  set  of  criteria  for  determining  whether  a  copying  error  had 
occurred  was  established  prior  to  the  first  data-taking  session.  The  cri¬ 
teria  and  the  categories  of  error  to  which  they  apply  arc  described  in  Table 
It  should  be  apparent  that  errors  take  diverse  forms,  even  in  copying  random 
numbers  (especially  when  these  occur  in  some  sequence,  such  as  a  row).  Pre¬ 
sumably  the  different  types  of  errors  have  differing  origins--an  interesting 
domain  for  research. 

Because  the  effects  of  any  information  feedback  presumably  would  not  de¬ 
velop  until  there  is  subsequent  performance,  error  data  were  taken  only  from 
the  last  four  blocks  within  a  condition  (including  the  baseline  conditions). 

Speed  of  Performance.  As  already  noted,  participants  were  told  to  type 
as  fast  as  possible.  That  was  their  only  opportunity  for  self-pacing.  They 
could  not  go  back  and  make  corrections,  and  both  display  time  and  delay  time 
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Table  1.  Error  criteria  in  the  Copying  study. 


1.  A  typed  digit  that  differs  from  the  projected  digit  in  the  same  row  position 
is  an  (confusion)  error.  Example:  27644310  typed,  27544310  projected. 

2.  If  a  digit  at  the  end  of  an  incomplete  typed  row  matches  the  digit  at  the  end 
of  the  projected  row  but  not  the  digit  in  the  same  position  in  the  projected 
row,  it  is  not  an  (confusion)  error,  whether  or  not  it  is  part  of  a  sequence. 
Example:  401936^  typed,  40193476_  projected. 

3.  Not  typing  a  projected  digit  is  an  (omission)  error,  due  perhaps  to  an  (antic¬ 
ipation)  error.  Example:  9824336  typed,  98254336  projected. 

4.  A  typed  digit  is  an  (intrusion,  addition,  or  perseveration)  error  when  it  oc¬ 
curs  before  the  end  of  the  ro w,  if  it  does  not  match  the  projected  digit  in 
the  same  row  position  and  precedes  a  typed  sequence  that  matches  a  projected 
sequence.  (However,  it  does  not  count  as  an  error  if  there  has  been  an  omis¬ 
sion  error  earlier  in  the  row.)  Example:  7276237  typed,  72723701  projected. 

5.  A  typed  digit  is  an  (addition  or  perseveration)  error  when  it  occurs  at  the 
end  of  the  row  and  causes  the  total  number  of  digits  in  the  row  to  exceed  the 
total  in  the  projected  row,  unless  it  is  part  of  a  projected  sequence. 

Example:  077352809_  typed,  07735280  projected^.  (NOT  077395280^  typed,  07735280 
projected. 

6.  A  typed  sequence  of  two  or  more  digits  matching  a  projected  sequence  is  NOT 
considered  an  (location)  error  when  the  location  of  the  sequence  in  the  typed 
row  differs  from  the  location  in  the  projected  ro w,  except  where  two  cwo- 
digit  sequences  are  reversed  (see  below) .  Examples:  81127813  typed,  08278136 
projected  or  81162781  projected. 

7.  Typing  two  projected  digits  in  reversed  order  counts  as  one  (transposition) 
error  and  is  regarded  otherwise  as  a  sequence.  Example:  74465901,  74456901 
projected. 

8.  Typing  three  projected  digits  in  some  kind  of  reversed  order  counts  as  one 
(transposition)  error  and  is  regarded  otherwise  as  a  sequence.  Example:  365 
typed,  663  projected;  058  typed,  806  projected. 

9.  Typing  three  projected  digits  of  which  two  are  the  same  so  the  single  digit 
is  doubled  counts  as  two  (exchange)  errors  and  is  regarded  otherwise  as  a 
sequence.  Examples:  363  typed,  636  projected;  336^  typed,  663  projected. 

10.  Typing  four  projected  digits  in  which  two  two-digit  sequences  arc  reversed 

counts  as  one  (transposition)  error  and  is  regarded  otherwise  as  a  sequence. 
Examples:  3663  typed,  6336  projected;  3366  typed,  (>633  projected;  3654  typed; 

5436  projected. 


Table  1  (continued) 


LI.  Typing  four  digits  with  some  reversals  of  the  same  projected  digits  counts  as 
two  (transposition)  errors  and  otherwise  is  regarded  as  a  sequence.  Examples 
2415  typed,  1425  projected;  5142  typed,  1425  projected. 

L2.  Typing  four  projected  digits  of  which  three  are  the  same,  so  the  single  digit 
is  tripled,  counts  as  three  (exchange)  errors  and  is  regarded  otherwise  as  a 
sequence.  Example:  3666  typed,  6333  projected. 


The  omission  or  repetition  of  a  row  or  typing  a  row  unrelated  to  the  pro¬ 
jected  row  does  not  constitute  an  error,  but  in  calculating  error  percentage  the 
denominator  must  be  reduced  if  the  row  is  omitted;  or  the  error  total  should  be 
reduced  proportionately.  A  repeated  row  should  be  disregarded.  If  the  typing 
clearly  jumps  within  a  typed  row  from  one  projected  row  to  the  next,  the  typed 
row  will  be  regarded  as  a  legitimate  one  composed  of  the  two  projected  parts,  and 
one  row  will  be  indicated  as  omitted.  Non-digit  characters  per  se  are  not  errors. 


were  fixed  at  5  and  10  seconds,  respectively.  Thus,  it  should  not  be  sur¬ 
prising  that  within  participants  and  between  conditions  across  and  within 
participants  there  was  little  variation  in  the  time  a  participant  took  to 
copy  a  block  of  random  numerals.  Median  durations  were  derived  from  all  five 
blocks  per  participant  in  a  condition.  Across  conditions  the  medians  (of 
block  medians)  varied  only  within  a  9-second  span;  the  median  of  these  was 
3  minutes  and  14  seconds.  The  median  of  the  medians  among  participants  was 
3:12;  individual  medians  ranged  from  2:51  to  3:32,  presumably  reflecting  dif¬ 
ferences  in  typing  speeds  or,  possibly,  further  rehearsal  during  typing. 

Accuracy  of  Performance  (Errors).  As  Table  2  indicates,  the  overall 
mean  error  rates  of  the  three  consequat ion/ feedback  conditions  were  almost 
the  same:  9.5  percent  for  "most  right,",  8.9  percent  for  "some  wrong,"  and 
9.0  percent  for  "some  wrong.  Mistakes .. .most  unfortunate..."  By  inspection, 
these  results  justify  no  assertion  of  different  effects  between  feedback 
conditions  and  fail  to  warrant  statistical  significance  treatment.  The  er¬ 
ror  rates  come  close  to  those  in  the  prior  session  for  the  six  participants 
who  took  part  in  that  (rather  than  in  procedural  exploration  sessions) : 

9.7  percent,  8.4  percent,  and  9.2  percent. 

The  error  percentages  for  the  two  baseline  conditions,  without  feedback, 
were  11.2  (10.8  in  the  prior  session)  at  the  start  of  the  session  and  7.0 
(6.8  in  the  prior  session)  at  the  end.  The  relatively  small  differences  be¬ 
tween  these  and  the  feedback  condition  percentages  appeared  to  have  little 
material  importance  and  if  only  due  to  marked  within-subject-condition  vari¬ 
ance  were  not  examined  for  statistical  significance.  They  may  well  have  re¬ 
flected  improving  performance  during  the  session  as  a  result  of  practice, 
including  procedural  improvements  despite  efforts  to  forestall  this  by  means 
of  prior  and  exploration  sessions.  This  hypothesis  is  supported  by  the  error 
percentages  according  to  the  order  of  condition  encounter:  11.2,  11.0,  9.1, 
7.4,  and  7.0,  as  well  as  by  the  difference  between  a  6.8  baseline  mean  per¬ 
centage  at  the  end  of  the  prior  session  and  a  11.7  percentage  for  the  same 
six  participants  in  the  baseline  mean  percentage  at  the  start  of  the  principal 
session  six  to  14  days  later. 

The  low  error  rate  in  the  final  baseline  condition,  without  feedback, 
may  have  been  due  to  the  cumulative  effects  of  the  preceding  three  feedback 
conditions,  as  well  as,  or  instead  of,  being  due  to  practice.  The  drop  in 
this  final  percentage  prevents  any  assertion  that  feedback  reduced  errors, 
since  it  was  added  as  a  within-subjects  control  condition.  It  could  be  hy¬ 
pothesized  that  it  might  have  risen,  as  it  did  between  sessions,  and  that 
differences  might  have  been  demonstrated  among  feedback  conditions,  if  (1) 
there  had  been  intervals  longer  than  5  minutes  between  conditions,  and  (2) 
there  had  been  more  than  five  blocks  per  condition  (as  in  better  time-series 
experiments)  . 

The  overall  error  percentage  in  the  study,  for  the  principal  session, 
was  9.1.  This  represented  success  in  developing  ways  to  assure  error  rates 
that  were  neither  excessively  low  nor  extremely  high,  l.rror  percentages 
varied  considerably  among  participants,  from  2.1  to  15.6.  The  2.1 -error 
participant  was  assigned  8-numeral  instead  of  9-numeral  rows  through  inves¬ 
tigator  misinterpretation  of  the  ground  rules,  as  that  participant  had  had  a 


Table  2.  Copying  Errors.  Figures  in  parentheses  show  errors  in  prior  session. 


S-nuneral  strings  in  the  prior  session. 


r 


7.6  percent  error  rate  in  the  prior  session  with  8-numeral  rows.  One  par¬ 
ticipant  who  had  a  6.3  percent  error  rate  in  the  prior  session  with  8-nu¬ 
meral  rows  had  a  10.3  rate  with  9-numeral  rows  in  the  principal  session, 
bearing  out  the  conclusion  from  the  procedural  exploration  sessions  that 
error  rate  increased  with  the  number  of  numerals  in  a  row  to  copy  (and  to 
remember  after  a  10-second  interval). 

No  analysis  was  performed  on  the  incidence  of  different  kinds  of  copying 
errors.  Such  an  analysis  (see  Bailey,  1978,  for  a  limited  example)  would 
become  an  appropriate  aspect  of  an  information  processing  investigation  of 
the  origins  of  different  kinds  of  copying  (and  short-term  memory)  errors  in¬ 
volving  random  numbers. 

Self-Report  Data.  Each  participant  responded  to  eight  queries  from  an 
investigator  at  the  end  of  the  participant's  final  session.  The  queries  and 
replies  were: 

(1)  What  did  you  think  was  the  purpose  of  the  experiment  today?  Two 
participants  said  simply  it  was  to  determine  accuracy,  in  clerical  jobs 
(IX)  and  in  remembering  (IV).  The  others  all  referred  in  some  fashion 
to  the  effects  of  the  feedback,  as  "difference  in  accuracy  with  differ¬ 
ent  types  of  evaluation"  (I),  "what  rewards  would  bring  best  efforts" 

(VIII),  "something  to  do  with  positive  and  negative  reinforement- -but  I 
tried  not  to  let  it  interfere"  (VII),  "Effect  of  positive,  negative  re¬ 
inforcement"  (V),  "how  better  one  would  do  with  a  stimulus  like  wrong" 

(III),  "see  if  any  change  occurred  in  the  number  right  or  wrong  after  a 
statement  right  or  wrong"  (II),  "like  something  in  psychology,  stress;  I 
did  react--reacted  more  when  scolded"  (X),  and  "to  see  what  someone  in 
authority  would  say  and  how  it  would  affect  your  work"  (VI).  Thus,  most 
verbalized  the  aim  of  the  study. 

(2)  What  did  you  do  during  the  delay  period,  between  seeing  the  numbers 
and  typing?  Two  said  merely  that  they  "tried  to  memorize"  (VI)  or 'tried 
to  remember"  (IX)  the  numbers.  One  (V)  "figured  out  ways  I  could  remember 
them  best."  One  (VIII)  reported  holding  "a  mental  picture  of  the  numbers," 
and  two  (II  and  VIII)  tried,  respectively,  to  visualize  them  or  the 
easiest  four.  Including  these  two,  six  participants  said  they  repented  the 
numbers  (to  themselves  or  in  their  heads);  for  (VII)  these  were  "the  hard¬ 
est  four."  Three  of  these  said  they  separated  a  row  of  numerals  into  groups 
or  blocks.  One  participant  (III)  reported  turning  some  numbers  into  dates, 
another  (I)  into  dates,  addresses,  and  telephone  numbers,  and  this  last 
reported  also  using  "a  rhythm  technique  and  other  devices,  slowing  the 
rhythm  in  typing,  and  "telling  myself.  Met  it  go'  when  I  made  a  mistake." 

On  the  basis  of  these  self-reports,  one  might  state  that  the  study  in¬ 
vestigated  short-term  memory  rather  than  copying,  which  was  simply  the 
method  of  inquiry. 

(3)  What  was  your  react i on  to  the  comments  shown  you  at  the  end  of  typing 

each  column?  Only  two  participants  reported  that  these  led  to  performing 
differently:  IX  said  "I  tried  to  do  better,  not  make  the  same  mistakes," 

and  IV,  who  encountered  condition  B  second,  said  "At  first  I  felt  good, 
then  not  so  good;  they  made  me  work  harder,  concentrate  more,"  but  II  said 
I'm  not  sure  1  tried  that  much  harder  when  it  said  !  got  some  of  them  wrong.' 


dr 
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Three  reported  a  kind  of  disregard:  X  said  that  "at  first  T  took  it 
literally,  later  I  took  my  mind  off  it  so  I  wouldn't  get  depressed"; 

I  said  "my  thoughts  were  to  disregard  them--with  the  positives  I  was 
anxious  to  get  to  the  next  sequence,  with  the  negatives  I  told  myself 
to  ignore  them--I  got  internally  kranky  about  them--I  was  hungry  and 
thinking  about  things  to  do  at  home--I  wanted  to  finish";  III  said  "I 
thought  it  was  kind  of  funny--I  even  snickered--but  I  realized  it  was 
an  experiment."  Five  participants  expressed  feeling  a  lack  of  congru¬ 
ence  between  the  comments  and  what  they  had  done:  VII  said  "I  didn't 
think  it  always  fit";  VI  said  "sometimes  I  thought  they  didn't  say 
what  I  did";  V  said  "a  couple  of  times  I  thought  I  had  them  all"  (though 
V  did  not);  II  said  "no  matter  what  I  typed  I  could  have  gotten  the  same 
correct";  and  VIII  said  "I  couldn’t  figure  them  out--there  was  no  rela¬ 
tionship  to  what  was  happening;  they'd  appear  the  same  no  matter  what  my 
performance  was."  If  these  self-reports  reflected  the  effects  of  the 
feedback  comments  on  error-making  performance,  little  or  no  impact  would 
be  presumed. 

(4)  Consider  a  7-point  scale  on  which  very  unpleasant  is  1,  very  pleasant 


is  7,  and 

neither  is  4.  Rate  how  you  felt  about  I 

teing  told  (a)  You  got 

i  most  of  them  right;  (b)  You  got  some  of  them  wronj 

’;  (c)  You  got  some  of 

them  wrom 

Mistakes  with  numbers  can  be  most  unfortunate  in  some  cler- 

ical  tasks.  The  results  are  shown  in  Figure  1.  Clearly,  feedback  (a) 
had  no  or  little  reported  hedonic  effect  on  five  participants,  though  the 
reaction  from  IV  is  consistent  with  replies  to  query  3,  and  feedback  (b) 
had  no  or  little  reported  hedonic  effect  on  six  participants.  Feedback 
(c)  had  a  stronger  effect.  Indeed,  III  said  "it  was  threatening,"  but 
VIII  reported  being  mystified  about  the  reference  to  clerical  tasks. 

Participant  VI  said  (a)  "was  nice,"  and  commenting  on  (a)  and  (b)  partic 
pant  VII  reported  relying  more  on  self- judgment ,  "telling  myself  when  I 
didn't  do  well."  By  inspection,  there  appears  to  be  little  relationship 
between  these  ratings  and  error  totals  for  individuals  in  different  feed¬ 
back  conditions,  though  some  relationship  might  have  been  expected. 

(5)  Did  you  try  at  any  time  not  to  make  mistakes?  All  participants  re¬ 
plied  affirmatively  and  four  said  all  the  time,  though  II  indicated  be¬ 
ginning  to  do  so  about  one-third  of  the  way  through  the  session;  partici¬ 
pant  I  denied  trying  harder  at  one  time  than  another  but  said  it  was 
harder  to  concentrate  on  particular  numbers  when  there  were  negative  com¬ 
ments.  In  response  to  Why? ,  three  (I,  V,  and  IX)  mentioned  "challenge," 

I  added  "I  wanted  to  prove  to  myself  I  could  do  it  all  right,"  VIII  said 
"fun,"  X  said  "self-satisfaction,”  and  IX  and  II,  respectively,  "because 

I  don't  like  making  mistakes"  and  "because  basically  I'm  being  tested; 

I'm  conditioned  to  do  things  right."  These  self-reports  could  be  taken 
as  evidence  of  cither  intrinsic  motivation  or  past  conditioning  that 
could  well  have  overridden  any  differential  effects  of  the  feedback  com¬ 
ments.  In  response  to  How? ,  four  participants  (I,  IV,  VIII,  and  IX)  men¬ 
tioned  "concentrating”  (to  which  I  added  "[’.lying  attention  only  to  num¬ 
bers"),  two  (II  and  VI)  mentioned  "trying  hard"  to  remember  or  memorize, 

II  mentioned  repeating  the  numbers,  and  VII  reported  "trying  to  find  the 
best  method  to  keep  numbers  in  memory  longer."  Together  with  the  replies 
to  Question  (?),  it  seems  apparent  that  participants  tried  and  presumably 
succeeded  in  developing  or  instituting  procedures  to  improve  their  memory 
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(and  copying)  performance.  These  procedures  could  include  "concentrating" 
and  "trying  hard"  (terms  that  need  further  analysis).  Such  procedural 
changes  can  be  viewed  as  a  form  of  "learning"  (though  perhaps  different 
from  what  that  term  is  often  assumed  to  mean).  If  this  were  the  case, 
the  design  of  the  study  failed  to  prevent  within-study  learning  from 
confounding  the  effects  of  the  independent  variable  (type  of  feedback). 

(6)  Did  you  ever  type  more  slowly  to  keep  from  making  mistakes?  Only 
two  participants,  1  and  X,  denied  ever  typing  more  slowly,  though  some 
said  doing  so  was  only  occasional,  and  two  reported  they  "must  have"  or 
"might  have."  As  noted  earlier,  the  duration  data  showed  almost  no  vari¬ 
ation  in  the  median  times  taken  to  copy  columns  of  numbers,  between  con¬ 
ditions.  Within  conditions,  durations  of  copying  individual  columns 
rarely  reached  a  difference  of  5  percent  greater  or  less  than  the  median. 

(7)  Did  you  think  any  sets  of  numbers  were  repeated0  Six  participants 
said  they  thought  so,  but  some  thought  there  were  only  a  few  or  once; 
one  said  "no,"  one  didn't  know,  and  a  third  reported  thinking  so  but  be¬ 
ing  unable  to  detect  any  repetitions.  In  actuality,  as  noted  earlier, 
within  cither  the  principal  or  prior  session  there  were  no  repetitions 
of  rows  of  numerals  (though  there  were  chance  repetitions  of  short  se¬ 
quences).  However,  for  the  most  part  the  columns  (blocks)  of  rows  used 
in  the  prior  session  were  used  again,  in  different  conditions,  in  the 
later  one,  at  least  six  days  later. 

(8)  Comments?  Participant  II  expressed  indifference  to  the  negative 
statements,  as  "not  disturbing."  This  individual  reported  having  taken 
a  course  in  introductory  psychology.  Participant  \  said  "it  was  a  very 
good  test,  it  makes  you  think  all  the  time."  In  view  of  the  latter  com¬ 
ment,  together  with  replies  to  Question  f> ,  it  would  seem  that  the  ex¬ 
perimental  situation  hardly  resembled  i cerydav  clerical  or  working  cir¬ 
cumstances.  Perhaps  greater  external  val  idity- -and  possibly  more  dif¬ 
ferential  effects  from  the  kind'  of  feedback- -would  have  resulted  from 
maintaining  each  feednack  condition  f.  r  a  much  longer  time,  as  in  many 
single- subject ,  time-series  designs. 


Discussion  , 

As  noted  at  the  out  set  of  this  report,  this  Copying  studv  was  explora 
tor>’,  in  its  principal  session  as  well  as  sessions  that  wen*  devoted  ex¬ 
plicitly  to  procedural  exploration.  Hue  perhaps  to  the  design  of  the  export 
ment  ,  the  data  gave  no  reason  tor  saving  that  am  -neo-qua!  i  >>n .  t «  edb.n.  1 
condition,  was  more  eltective  than  am  other  tor  reducing  copying  errors,  or 
more  eftcctivo  than  no  feedback ;  for  the  same  reason ,  there  can  be  no  state 
ments  about  ineffectiveness. 

How  might  a  consoquator  ,  lurn’r  error  -prodiu  mg  behavior-'  Vconlmg  to 
reinforcement  principles,  a  t  ivorahle  ,  on  quit  ,»,■  alter  a  p<- rt  or-,  nu  <  error 
should,  bv  itself,  strengthen  that  behavior  make  i!  non  lik,  lv  to  nooiir, 
an  un  t  av  or  at'  1  e  come  qua  t  o  r  should  weikcr.  it  n  a  k  v  it  1  e  .  1  i  k  e  !  \  to  rocai!  . 

But  the  Ci'ii  s(>q  nr  1 1'  r  "m  t  he  eon  t  i  ngi  "  *  >n  t  tie  pe,  ill.  pe  r  I  e  r ■  i. m .  e  it  i  -  -up 
posed  to  lilt  huna  not  a  sern  o  t  a,  t  ion  .  in  wh  l  eh  tin  e  t  mi,  on  to  pm  a 


is  imbedded.  The  contingency  may  be  specified  through  a  discriminator, 
which  identifies  the  erroneous  action,  and  that  discriminator  may  be  pre¬ 
sented  to  the  performer  (or  originate  internally  from  past  experience)  at 
the  same  time  as  or  as  part  of  the  consequator  for  preceding  or  during 
the  performance).  Such  action-produced  discriminators  are  what  is  known 
as  information  feedback  or  knowledge  of  results.  Another  kind  of  discrim¬ 
inator  identifies  the  "correct"  action  to  take;  this,  too,  may  originate 
externally  (or  internally),  after  the  action  (or  before  or  during  it).  A 
different  action  in  the  future  is  one  result,  self-correction  another. 

None  or  little  of  this  analysis  applies  to  the  consequation/feedback 
conditions  in  this  Copying  study.  Verbal  consequators  were  given  at  the  end 
of  a  block  of  80  or  90  numerals  rather  than  for  a  numeral  by  itself.  A  con¬ 
sequator  was  not  contingent  on  a  specific  action  that  re  ulted  in  an  error, 
nor  were  there  external  discriminators  that  identified  those.  (There  un¬ 
doubtedly  were  some  internal  self  corrections  but  fewer  than  those  described 
by  Bailey  (1978),  since  participants  could  not  go  back  and  change  anything 
they  typed.)  Some  of  the  participants,  in  response  to  Question  5  at  the  end 
of  the  principal  session,  expressed  their  concern  that  there  seemed  to  be 
little  relationship  between  the  feedback  and  what  they  had  done. 

On  the  other  hand,  in  accord  with  reinforcement  principles  a  consequator 
may  influence  the  adoption  or  change  of  the  "rules"  that  determine  what  hu¬ 
man  beings  do  in  rule-following  behavior.  Rules  are  procedures.  Procedures 
are  sequences  of  actions  with  sequences  of  discriminators  or  they  may  be  com¬ 
plementary  behavior  that  affects  or  modifies  the  performance  of  direct  in¬ 
terest.  In  this  Copying  study,  according  to  self-reports  participants 
adopted  procedures  to  prevent  mistakes,  such  as  rehearsal  (covert  repetition 
of  numbers  during  the  delay  period),  grouping  (chunking)  of  numerals  (and 
even  rhythm),  visualisation,  and  concentration.  What  explicit  effect  did  the 
consequation/feedback  have  on  adopting  and  following  these  tactics?  Under 
the  circumstances,  neither  a  verbal  conscquation  contingency  nor  an  external 
discriminator  can  be  identified  as  responsible.  The  procedures  may  have  come 
from  prior  experience  in  memorizing  verbal  material.  More  investigation  of 
such  matters  is  needed,  with  reliance  on  performance  data  as  well  as  self- 
report  -  -va  luab  lc  as  the  latter  can  be  for  exploration. 

The  feedback  conditions  may  have  added  to  the  "demand  characteristics" 
of  the  situation,  but  the  potentiation  already  present  for  participant-,  in 
being,  in  a  sense,  "tested"  in  an  experiment  probably  affected  them  in  the 
baseline  conditions  us  well.  It  is  clearly  difficult  to  study  error-making 
performance  in  a  laboratory  situation.  lffects  of  prior  experience  cannot 
be  fully  forestalled.  Students  especially  tend  of  try  to  avoid  making  mis¬ 
takes;  their  academic  environment  requires  sueh  behavior.  Vlditiona]  vari 
ables  may  have  less  observable  impart  than  they  would  in  other  c  i  reumst  unce-- . 


CONSEQUATOR  RATING  STUDY 


It  may  be  possible  to  determine  how  much  influence  a  consequator  would 
exert  over  some  behavior  of  an  individual  by  asking  him  or  her  whether  or 
how  much  he/she  likes  or  dislikes  a  particular  outcome  of  that  behavior,  or 
otherwise  reacts  hedonically  to  an  outcome.  Presumably  if  an  outcome  is  re¬ 
ported  as  liked  or  wanted,  or  as  making  a  person  happy,  it  will  function  as 
a  favorable  consequator  and  make  the  behavior  more  likely  to  reoccur.  If  it 
is  reported  as  disliked  or  rejected  or  as  making  a  person  unhappy,  it  will 
function  as  an  unfavorable  consequator  and  make  the  behavior  less  likely  to 
reoccur.  The  technique  of  asking  people  what  they  want  has  been  used  in  some 
token  economy  studies  to  determine  what  tokens  should  be  exchangeable  for, 
and  how  many  tokens  per  ultimate  consequator.  However,  it  does  not  seem  to 
have  been  exploited  to  find  out  what  could  "punish"  or  suppress  some  behavior, 
or  to  discover  outcomes  people  might  try  to  avoid.  Nor  has  it  been  investi¬ 
gated  with  respect  to  verbal  consequators ,  favorable  or  unfavorable. 

To  explore  reactions  to  such  potential  consequators,  seven  working  women 
were  asked  to  rate  how  unhappy  or  happy  they  would  feel  about  another  per¬ 
son's  verbal  comments  concerning  a  task  they  had  performed.  The  participants 
all  worked  in  various  offices  in  a  technical/industrial  park  in  a  Washington, 
D.C.  suburb,  in  clerical  or  supervisory  positions.  Their  ages  varied  from  the 
twenties  to  the  fifties.  They  were  individually  queried  after  receiving  writ¬ 
ten  instructions.  Anonymity  was  assured,  and  they  signed  informed  consent 
forms . 

They  were  shown  a  graphic,  bimodal,  9-point  (9-digit)  rating  scale  on 
which  the  end  digit  1,  always  on  the  left,  meant  that  the  comment  "would  have 
made  you  feel  extremely  unhappy,"  the  end  digit  9,  on  the  right,  meant  that 
the  comment  "would  have  made  you  extremely  happy,"  and  the  middle  digit  F>  meant 
that  "it  wouldn’t  have  made  you  feel  one  way  or  the  other."  A  participant 
selected  and  wrote  down  on  a  separate  blank  sheet  of  paper  a  rating  number  for 
each  item,  working  at  her  own  pace  but  being  timed.  There  were  12)  items 
listed  alphabetically  in  columns  on  other  sheets  of  paper.  They  are  listed  by 
ratings  in  Table  d,  but  their  alphabetical  order  in  the  study  indiscriminately 
mixed  presumably  unfavorable,  neutral,  and  favorable  comments.  The  items  were 
drawn  from  a  thesaurus  and  from  the  investigator's  own  head.  Ihey  included 
adjectival  statements  and  phrases,  nouns,  and  exclamations,  some  adxersely 
critical,  some  complimentary,  some  neither.  Some  were  oriented  primarily  to- 
wardthe  task  itself  and  its  outcome,  some  toward  the  individual's  personality 
characteristics  as  evidenced  in  task  performance. 

There  were  two  tasks  to  consider  (  separ.it  e  1  y  i  :  (It  perturming  a  job  at 
work,  and  (21  taking  a  test  at  school.  lor  the  first,  a  pa rf i c i punt  was  told 
to  "Suppose  you  asked  an  associate  whose  iiidcment  you  ia  .pu  tn!  to  ,  onneiit  on 
a  job  sou  had  tost  done,  and  the  assoc i  it  e  made  one  o:  the  tollowinr  .onrunt 
for  t  he  second,  the  participant  was  told  to  "suppo  *dn  n  sou  »eie  i  student 
you  t  ook  a  test  and  the  teacher,  using  some  of  iccl  i u  irttili on.  madi  one  ol 
the  following  comment s . ” 


further,  each  of  tins,  t  ok  had  t  hive  i  oik!  it  ion  t  oi  *  ...  h  pa  i  t  i  •  i  pant 
(11  there  was  no  men  t  l  on  whether  it  wo-  >  a  ■  \  or  .1 1  t  i  i  i  u  1  t  , 


i  I  w  a 


Table  3.  Median  ratings  of  happiness-unhappiness  about  an 

associate's  comments  on  performance  of  a  difficult 
job.  (Rating  of  1  means  extremely  unhappy,  of  9 
extremely  happy,  of  5  neither  unhappy  nor  happy.) 


Rating  for 

Rating  for 

Difficult  Job 

Comment 

Easy  Job 

Remarks 

1 

Bad  stuff 

1 

Unanimous  (E) 

1 

It's  deplorable 

1 

(b).  (g) 

1 

It's  disappointing 

1 

(g) 

1 

Disapproved 

1 

(g) 

1 

It's  discreditable 

1 

(e),  (g) 

1 

It's  disgraceful 

1 

1 

It's  dreadful 

1 

(b)  Unanimous  (E) 

1 

You're  dumb 

1 

(b)  Unanimous  (E) 

1 

Dummy '. 

1 

Redundancy,  Unanimous  (E) 

1 

It's  a  failure 

1 

1 

It's  frightful 

1 

(b),  (g) 

1 

It's  ghastly 

1 

(a),  (b) 

1 

Horrible! 

1 

Unanimous  (E) 

1 

Idiot 

1 

Unanimous  (E,  D) 

1 

It's  idiotic 

1 

Redundancy,  Unanimous  (D) 

1 

Imbecile! 

1 

Unanimous  (E,  D) 

1 

It's  imbecilic 

1 

Redundancy,  Unanimous  (E,  D) 

1 

It's  incompetent 

1 

(g) 

1 

It's  lousy 

1 

1 

Moron! 

1 

Unanimous  (E,  D) 

1 

It's  moronic 

1 

Redundancy,  Unanimous  (E.D) 

1 

Poorly  done 

1 

(g) 

1 

Poor  stuff 

1 

(g) ,  Unanimous  (E) 

1 

It's  ridiculous 

1 

• 

1 

Shame  on  you! 

1 

1 

It's  stupid 

1 

1 

You're  stupid 

1 

Redundancy,  Unanimous  (E) 

1 

It's  terrible 

1 

Unanimous  (1.) 

1 

It's  very  bad 

1 

Unanimous  (E) 

1 

It's  worse  than  usual 

1 

(c),  (g) 

1 

You're  wrong 

1 

(e) ,  (g) 

2 

It's  absurd 

2 

It's  aw  f u 1 

1 

(a) 

2 

Below  par 

1 

You're  crazy 

1 

*> 

It's  defective 

1 

la) 

2 

It's  disreputable 

1 

■> 

You're  foolish 

It's  i rrat i ona I 

1 

■> 

You're  nutty 

T 

It's  pi t i ab 1 c 

1 

l a  1  ,  (gl 

Pretty  bud 

) 

•> 

It's  unsucce -■  s  t  u ! 

!>  O'  O'  O'  invvvtviviviwuitnuuiuiunui  enenWWWWOiWWWCrtWWW 


Table  5,  Continued  (2) 


Rating  for 
Difficult  Job 


Comment 

It's  false 
It's  faulty 
It's  imperfect 
It's  inaccurate 
It's  inadequate 
It's  incorrect 
It's  insufficient 
It's  inferior 
You're  obtuse 
It's  second  rate 
You're  silly 
It's  so  so 
It's  unfortunate 
It's  unobjectionable 

It's  contemptible 
It's  mediocre 
You're  mistaken 
No  comment 
Not  good 

It's  reprehensible 
It's  tolerable 
It's  uneven 

It's  accurate 
It's  adequate 
It's  average 
It's  competent 
It's  creditable 
It's  fair 
It's  fine 
It's  good  enough 
It's  laudable 
It's  middling 
It's  normal 
O.K. 

It's  ordinary 
It's  rational 
It's  sound 

You're  astute 
Better  than  usual 
It’s  correct 
It's  effective 
It's  encouraging 
It's  n i ce 
Not  bad! 

You're  right 
Thanks 


Rating  for 
Easy  Job 


Remarks 


(e) ,  (h) 

(f) 


(a),  (c),  (d),  (e) 


(b),  (g) 

(a),  (c) 

(a),  (c) 

(g) 

(a),  (c) 

(h) 

(a),  (b),  (e) 
(a),  (b),  (d) 

(f) 


(a),  (e) 

(a).  (O,  (d) 


-  *  • 


I 


Table  3,  Continued  (3) 
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Rating  for  Rating  for 

Difficult  Job  Comment  Easy  Job 


Remarks 


7 

It's  distinguished 

7 

(b) 

7 

Good  for  you 

7 

7 

Pretty  good 

6 

7 

It's  true 

5 

(a)  ,  (h) 

7 

It's  worthy 

5 

8 

Above  par 

5 

(a)  ,  (c) 

8 

It's  admirable 

7 

8 

Approved 

5 

(a)  ,  (c)  ,  (d) 

8 

It's  commendable 

6 

8 

It's  exemplary 

5 

(e) 

8 

It's  first  rate 

7 

(a) 

8 

Good  stuff 

7 

8 

It's  perfunctory 

5 

(a)  ,  (c) 

8 

It's  praiseworthy 

6 

8 

You're  sharp 

7 

8 

Well  done 

7 

1 


9 

Bravo! 

9 

You're  bright 

9 

It's  brilliant 

9 

You're  brilliant 

9 

It's  excellent 

9 

Genius .' 

9 

It's  grand 

9 

Great! 

9 

It's  magnificent 

9 

It's  meritorious 

9 

It's  perfect 

9 

It's  splendid 

9 

It's  a  success 

9 

It's  superb 

9 

It’s  superior 

9 

It's  superlative 

9 

It's  tip  top 

9 

It's  very  good 

9 

It's  wonderful 

KEY  to  Remarks 


9 

7 
9 
9 
9 
9 

8 
9 

7 
6 
9 

8 
6 
9 
9 
8 
8 
8 
8 


Redundancy , 


(f)  , 


Unanimous 


(D) 


(f) ,  Unanimous  (D) 

(f) 

(a) 

(b) 

(f) 

(a)  ,  (hi 

(f)  ,  Unanimous,  (D) 
Unanimous,  (D) 

(h) 


H 

3 

t 


At  least  one  1.  rating  is  1  places  from  the  median. 
At  least  one  P  rating  is  at  least  1  places  t  rom  th 


mod i an . 


c 

There 

are 

extreme  1  rating 

s  abo\ 

e  and  he  Ion  tin'  med 

i  an 

d 

There 

a  re 

extreme  P  rating 

s  aho\ 

e  and  below  the  med 

i  an 

c 

There 

a  re 

a  rat i  ngs  abov e 

the  1 

med i an ,  none  he  1 ow 

i  t  . 

f 

There 

a  re 

a  ra t i ngs  he  1 ow 

the  1 

med ian,  none  ahov e 

i  t  . 

g 

1  here 

a  re 

a  rat  t  nr.-'  ahm  e 

the  P 

med  i  m  ,  none  he  1 1  >u 

i  t  . 

h 

There 

are 

a  rat i ngs  he  1  oh 

the  P 

mod  i  ill ,  none  abov  e 

i  t  . 

E 

Easy 

h)fi 

ra  t i ngs . 

[> 

I>i  ff i 

cu  1  r 

dob  rat i ngs . 
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described  as  easy  ("Now  suppose  you  thought  the  job  (test)  was  an  easy  one")  ; 
or  (3)  it  was  described  as  difficult  ("Now  suppose  you  thought  the  job  (test) 
was  a  difficult  one").  Each  participant  went  through  the  list  of  124  items 
at  least  six  times,  once  for  each  of  the  three  conditions  in  each  of  the  two 
tasks.  The  three  conditions  for  the  Job,  in  the  order  of  no  mention,  easy, 
and  difficult,  were  followed  by  those  for  the  Test,  in  the  order  of  no  men¬ 
tion,  difficult,  and  easy.  In  addition,  the  no-mention  condition  for  the  Job 
was  repeated  with  six  participants,  in  five  cases  after  the  Test  portion,  in 
one  between  the  Job  and  Test  portions;  and  Job  Difficult  was  repeated  with 
one  of  the  participants,  at  the  end. 

Results 

Durations.  The  average  duration  for  completing  the  list  of  124  items 
was  about  six  minutes  but  individuals  varied  greatly.  The  slowest  respondent 
took  about  13  minutes  for  the  first  set  of  reactions  to  the  124-item  list, 
and  about  7.5  minutes  for  the  last;  the  fastest  respondent  took  5  and  4  mi¬ 
nutes.  Durations  were  reduced  by  20  to  SO  percent  between  first  and  last, 
with  all  participants  speeding  up  during  the  session. 

Reliability.  The  first  encounter  with  the  list  of  items  and  rating  them 
was  the  condition  of  Job  with  no  mention  of  ease  or  difficulty.  The  data 
were  not  used  except  to  compare  them  with  the  later  Job  No-Mention.  In  that 
comparison,  47  percent  of  the  744  ratings  of  items  for  six  participants  dif¬ 
fered  in  the  later  set  from  the  earlier;  23  percent  of  the  differences  were 
greater  than  one  place  on  the  scale.  There  were  approximately  the  same  num¬ 
ber  of  increases  in  ratings  (25  percent)  as  decreases  (22  percent).  Among 
individuals  the  differences  ranged  from  30  percent  (with  14  percent  greater 
than  one  place)  to  65  percent  (with  48  percent  greater  than  one  place).  When 
one  subject  repeated  the  Job  Difficult  ratings,  there  were  22  percent  fewer 
differences  than  there  had  been  between  the  two  Job  No-Mention  conditions. 

It  might  be  presumed  that  the  first  Job  No-Mention  condition,  with  which  a 
session  began,  was  relatively  unreliable,  and  that  reliability  subsequently 
increased.  Due  to  the  remaining  unreliability,  inter-subject  variance,  and 
small  number  of  participants,  the  study  must  be  viewed  as  exploratory.  Anal¬ 
ysis  of  statistical  significance  was  not  undertaken. 

t 

Item  Ratings.  The  median  (between  participants)  ratings  for  the  124 
items  in  the  Job  Difficult  condition  are  shown  in  Table  3.  The  table  also 
shows  the  ratings  for  the  Job  Easy  condition.  The  remarks  column  contains 
a  number  of  data  modifiers  or  cautions.  In  cases  of  (a)  and  ( b ) ,  where  at 
least  one  E  rating  or  D  rating  is  4  places  from  the  median,  and  in  cases  of 
(c)  and  (d)  where  there  are  extreme  E  and  D  ratings  above  and  below  the  median, 
it  may  be  conjectured  that  the  item  presented  some  sort  of  difficulty  to  one 
or  more  participants,  resulting  in  a  deviant  rating.  In  any  case,  one  or 
more  participants  had  an  idiosyncratic  interpretation  of  the  item.  Instances 
of  (c),  ( f )  ,  (g),  and  (h)  indicate  there  would  be  somewhat  different  results 
if  means  had  been  derived  instead  of  medians:  (e)  and  (g)  indicate  that  the 
mean  rating  would  be  higher,  (f)  and  (g)  that  it  would  lie  lower.  These  modi¬ 
fiers  can  lie  related  to  differences  between  Easy  and  Difficult  ratings  (see 
below).  The  (a),  (b) ,  (c),  and  (d)  indicators  should  caution  about  the  relia¬ 
bility  of  these  items  in  any  future  use. 
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Some  comments  that  might  be  presumed  to  be  just  information-carriers 
may  be  seen  to  occasion  self-reports  of  extreme  unhappiness.  These  include 
"Wrong,"  "Incorrect,"  and  "False,"  though,  as  noted  below,  their  antonyms 
are  relatively  neutral.  "Failure"  and  "Success"  both  exert  strong  hedonic 
effects. 

Job  Easy  vs.  Job  Difficult.  Among  the  124  items,  there  were  62  dif¬ 
ferences  (50%)  between  medians  for  Job  Fasy  and  medians  for  Job  Difficult. 

Among  these  62,  90  percent  (56)  were  instances  where  the  rating  was  higher 
in  Job  Difficult  than  in  Job  Fasy.  Among  these  56,  28  were  ratings  of  com¬ 
ments  that  could  be  called  favorable  (rated  above  5  in  Job  Difficult),  24 
were  ratings  of  unfavorable  comments  (below  5),  and  4  were  ratings  of  neutral 
comments  (at  5).  Thus,  higher  ratings  for  unfavorable  plus  neutral  comments 
were  just  as  many  as  higher  ratings  for  favorable  comments.  A  comment  that 
was  favorable  for  task  outcome  when  the  task  was  easy  was  more  so  than  when 
the  task  is  difficult;  the  individual  reported  feeling  happier.  A  comment  that 
was  unfavorable  for  task  outcome  when  the  task  was  easy  was  less  so  when  the 
task  was  difficult;  the  individual  reported  feeling  less  unhappy.  Unfavor¬ 
able  comments  do  not  become  more  unfavorable  when  a  task  is  more  difficult, 
according  to  the  results  of  this  self-report  inquiry. 

The  above  summary  is  borne  out  by  inspection  of  the  data  in  Table  3. 

It  will  be  seen  that  the  median  item  ratings  for  Job  Difficult  are  generally 
either  higher  than  or  the  same  as  those  for  Job  Easy;  very  few  are  lower. 
Although  higher  ones  do  not  appear  when  the  Job  Difficult  median  ratings  are 
1,  the  modifier  (g)  for  ten  of  these  indicates  that  the  Job  Difficult  ratings 
should  be  considered  in  the  light  that  these  had  three  ratings  above  the 
median  and  none  below  it  and  thus  the  means  would  be  higher  than  1.  Con¬ 
versely,  the  five  (f)  modifiers  for  items  rated  9  for  Job  Difficult  indicate 
that  the  corresponding  9  ratings  for  Job  Easy  should  be  considered  in  the 
light  that  these  had  three  ratings  below  the  median  and  none  above  it  and  thus 
the  means  would  be  lower  than  9. 

Nevertheless,  many  of  the  unfavorable  comments  rated  at  1  were  resistant 
to  change  from  the  easy  job  to  the  difficult  job,  and  on  some  of  these  the 
respondents  were  unanimous  in  both  the  Easy  and  Difficult  ratings.  Examples 
were  "Idiot,"  "Imbecile,"  and  "Moron."  Perhaps  these  were  more  stable  be¬ 
cause  they  were  personality-oriented  rather  than  task-oriented.  At  the  other 
end  of  the  rating  scale,  there  were  several  favorable  consequators  that  were 
rated  9  for  both  easy  and  difficult  jobs,  without  an  (f)  modifier:  "Bravo!," 
"It's  excellent,"  and  "It's  superior."  But  none  of  these  was  rated  9  unani¬ 
mously  for  the  Easy  job.  If  one  had  been,  a  mean  rating  could  not  have  gone 
higher  for  the  Difficult  job.  Those  showing  the  greatest  difference  were 
"It's  meritorious"  and  "It's  a  success."  Some  stability  was  demonstrated  for 
"Neither"  (neutral)  ratings.  Of  15  rated  at  5  for  Difficult,  12  had  been  so 
rated  for  Easy.  However,  of  25  rated  at  5  for  Easy,  12  were  rated  higher  for 
Difficult  (and  one  lower) --another  indication  of  the  trend  to  rate  the  comments 
higher  for  Difficult  than  for  Easy. 

Test  Easy  vs.  lest  Difficult.  Of  8(>S  ratings  by  the  seven  participants, 

46  percent  (396)  were  different  for  these  two  task  conditions.  Of  t  lie  396, 

87  percent  received  higher  ratings  in  lest  Difficult.  All  but  one  of  the 
participants  produced  higher  ratings  in  Test  Difficult,  the  individual  in¬ 
creases  ranging  from  79  percent  to  98  percent.  Though  tiiese  figures  are  not 


directly  comparable  with  the  median  data  for  Job  Easy  vs.  Job  Difficult,  the 
87  percent  almost  matches  the  90  percent  for  differences  between  medians, 
and  the  46  percent  comes  close  to  the  50  percent  of  medians  where  there  were 
differences.  Thus,  the  Test  ratings  supported  the  findings  for  Job  ratings; 
as  task  difficulty  increases,  the  same  comments  arc  reported  as  making  a  per¬ 
son  happier  or  less  unhappy. 

Job  Difficult  vs.  Test  Difficult.  When  ratings  for  Job  Difficult  were 
compared  with  those  for  Test  Difficult,  of  868  paired  ratings  across  seven 
participants,  393  (45  percent)  were  different.  Of  these,  65  percent  (256) 
were  higher  on  the  Test  than  on  the  Job,  and  35  percent  lower.  When  the 
actual  differences  in  scale  placement  were  summed  for  the  593  pair  differ¬ 
ences,  the  total  was  686,  and  69  percent  of  this  total  (473)  came  from  ratings 
higher  on  the  Test  than  on  the  Job.  All  seven  participants  had  more  ratings 
that  were  higher  on  the  Test  than  on  the  Job;  six  produced  the  same  result 
when  placement  differences  were  totalled,  and  one  was  even.  For  46  items, 
there  were  at  least  two  more  participants  rating  an  item  higher  on  the  Test 
than  those  rating  it  lower,  or  vice  versa,  and  of  the  46,  42  (91  percent) 
were  instances  of  higher  rating  on  the  Test.  Among  these,  19  pertained  to 
favorable  comments  (rated  higher  than  5),  17  to  Unfavorable  (lower  than  5) 
and  6  to  Neutrals  (5).  This  latter  finding  indicates  that  higher  ratings 
for  the  Test  situation  tended  to  be  higher  whether  the  comment  was  derogatory 
(unfavorable)  or  complimentary  (favorable). 

Individual  differences  were  considerable.  They  ranged  from  27  percent 
more  for  Test  (34)  to  66  percent  (82)  out  of  124  possibilities  per  partici¬ 
pant  (and  from  56  to  144  for  summed  differences  in  scale  places) .  The  par¬ 
ticipant  with  the  smallest  preponderance  of  higher  ratings  for  Test  had  53 
percent  more  than  for  Job;  the  one  with  the  largest  margin  had  S9  percent 
more  for  Test.  The  mean  extent  of  place  differences  among  participants 
ranged  from  1.2  places  to  2.6. 

The  requirement  to  report  Test  ratings  called  for  a  more  retrospective 
self-report  than  did  Job  ratings,  and  the  source  of  the  consequators  (com¬ 
ments  about  the  task)  was  a  teacher  rather  than  an  associate.  The  differ¬ 
ences  in  ratings  between  Job  Difficult  and  Test  Difficult  may  have  been  due 
to  these  variations.  What  seems  more  striking,  however,  is  the  way  in  which 
the  results  for  two  different  tasks  tended  to  parallel  those  comparing  easy 
and  difficult  conditions  within  tasks.  Possibly  the  Test  situation  was  re¬ 
acted  to  as  more  difficult  than  the  Job  situation. 

In  any  case,  although  a  detailed  analysis  remains  to  be  performed  on  the 
Test  ratings,  they  appeared  to  resemble  those  for  the  Job  data.  Of  particu¬ 
lar  interest  is  the  finding  that  ratings  were  higher  on  the  Test  than  on  the 
Job  for  unfavorable  as  well  as  favorable  comments.  Ratings  did  not  go  down 
for  unfavorable  comments  and  up  for  favorable  ones,  as  might  have  been  sus¬ 
pected  in  comparing  two  types  of  tasks. 

Distribution  of  Ratings  among  Oonseqpiat tprs.  The  number  of  comments  re¬ 
ceiving  Job  Difficult  ratings  from  1  through  4  exceeded  the  number  receiving 
ratings  from  (>  through  9,  65  to  44.  This  disparity  may  have  been  due  to  se¬ 
lecting  more  unfavorable  than  favorable  comments  in  the  first  place,  ami  in 
turn  that  selection  might  have  resulted  from  a  greater  proportion  of  adverse 
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terms  in  the  English  language  and  thus  in  any  thesaurus.  Such  a  dispropor¬ 
tion  is  suggested  by  the  disproportion  of  hedonic  terms  in  Roget's  Tnter- 
national  Thesaurus  (Third  Edition).  Under  "pleasantness"  are  47  lines  of 
nouns,  verbs,  adjectives,  and  adverbs,  but  under  "unpleasantness,"  90  lines; 
under  "pleasure,"  there  are  82  lines,  but  under  "displeasure,"  149.  There 
is  no  section  for  "like,"  but  under  "dislike"  there  are  41  lines.  In  addi¬ 
tion,  there  are  25  lines  under  "approach"  but  106  under  "avoidance,"  not  to 
mention  47  under  "escape,"  40  under  "abandonment,"  and  23  under  "rejection." 
(However,  one  finds  167  lines  under  "desire."  "Aversion"  is  listed  under 
"dislike.")  More  inquiry  seems  warranted  as  to  the  distribution  of  favorable 
and  unfavorable  verbal  consequators  in  spoken  and  written  English  (and  other 
languages). 


There  were  almost  as  many  median  ratings  (50)  at  the  extreme  places  on 
the  scale--l  and  9--as  in  intermediate  places  (59),  excluding  the  central 
place,  5.  This  distribution  too,  may  reflect  the  selection  process,  which 
in  turn  may  reflect  language  practice.  Or  it  may  result,  at  least  in  part, 
from  behavior  in  using  a  9-point  rating  scale,  especially  with  regard  to  the 
small  number  of  4  and  7  ratings. 


Antonyms .  Interesting  comparisons  may  be  made  among  antonyms.  For  ex¬ 
ample,  for  Job  Easy,  "Approved"  was  rated  5  (i.e.,  making  the  respondent 
neither  happy  nor  unhappy),  whereas  "Disapproved"  was  rated  1  (i.e.,  making 
the  respondent  extremely  unhappy.)  The  same  contrast  occurred  with  "It's 
correct"  and  "It's  incorrect,"  "It's  rational"  and  "It's  irrational,"  It's 
true"  and  "It's  false,"  "It's  competent"  and  "It's  incompetent,"  "Above  par" 
and  "Below  par,"  "It's  creditable"  and  "It's  discreditable,"  and 'It's  adeauato" 
and  "It's  inadequate."  A  tendency  to  rate  the  "positive"  term  at  or  close  to 
a  neutral  position  while  its  "negative"  antonym  remained  at  the  extreme  end 
of  the  scale  was  evident,  to  a  smaller  degree,  among  Job  Difficult  ratings  and 
among  other  consequators,  such  as  "You're  right"  vs.  "You're  wrong"  (6  and  1 
for  both  Job  Easy  and  Job  Difficult),  "It's  accurate"  vs.  "It's  inaccurate" 

(7  and  1  for  Job  Easy),  "Well  done"  vs.  "Poorly  done”  (7  and  1  for  Job  Easy, 

8  and  1  for  Job  Difficult),  "Pretty  good"  vs.  "Pretty  bad"  (6  and  1  for  Job 
Easy),  "Better  than  Usual”  vs.  "It's  worse  than  usual"  (7  and  1  for  Job  Easy 
and  6  and  1  for  Job  Difficult),  "It's  a  success"  vs.  It's  a  failure"  (6  and  1 
for  Job  Easy),  and  "Good  stuff"  vs.  "Bad  stuff"  (7  a<nd  1  for  Job  Easy  and  8 
and  1  for  Job  Difficult).  Studies  which  have  used  "right"  and  "wrong"  (or 
similar  terms)  for  information  feedback  about  performance  presumably  have  as¬ 
sumed  equivalence  in  their  hedonic  (and  conscquat ion)  status,  if  they  have  made 
any  such  assumptions  at  all,  but  these  appear  to  be  questionable.  In  partic¬ 
ular,  many  terms  that  might  be  assumed  to  be  hedoni cully  positive  (and  thus 
favorable  consequators)  turned  out,  in  this  study,  to  be  hedonieally  neutral, 
or  close  to  it,  especially  when  respondents  reacted  to  the  task  as  an  easy  one; 
these  terms  included  "It's  fine,"  "O.K.,”  "It's  correct,"  "You're  right,"  and 
"Thanks . " 


Task  vs.  Personality  Terms.  The  extreme  reactions  to  personality  terms 
such  as  "Idiot”  (and  also  "Genius")  have  already  been  noted.  The  "It's"  items 
might  be  regarded  as  task  modifiers  and  the  "You're"  terms  (including  those 
so  implied,  such  as  "Moron."')  as  person  modifiers.  The  comments  presented  to 
participants  failed  to  make  it  possible  to  compare  systematically  these  two 
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types.  Indeed,  those  that  the  investigator  had  supposed  to  be  hedonically 
neutral  and  that  were  rated  as  such  were  all  "It's"  comments.  The  distinc¬ 
tion  deserves  further  inquiry. 

Individual  Differences.  With  a  sample  of  only  seven  participants  not 
too  much  can  be  said  about  differences  between  individuals,  except  that  even 
such  a  small  sample  revealed  substantial  differences.  Earlier  summaries  of 
data  have  indicated  some  of  these.  Some  were  probably  due  to  initial  un¬ 
familiarity  with  the  rating  scale.  The  ratings  for  .Job  Easy,  which  was  rated 
before  Job  Difficult,  had  26  instances  of  at  least  one  rating  four  places  from 
the  median  among  participants,  whereas  the  Job  Difficult  ratings  had  only  8, 
and  there  were  10  instances  of  extreme  ratings  both  above  and  below  the  median 
for  the  Job  Easy  task,  4  for  Job  Difficult.  One  participant  showed  a  strong 
tendency  to  rate  items  9  when  others  rated  them  lower.  Some  rated  an  item  9, 
others  5.  Some  terms  may  have  created  rating  problems  by  being  relatively 
obscure  to  some  individuals,  such  as  "ghastly,"  "laudable,"  "perfunctory," 
and  "reprehensible."  Occasional  individual  ratings  contrasted  markedly  with 
those  of  other  participants,  such  as  a  1  for  "It's  magnificent,"  and  a  9  for 
"It's  unobjectionable."  Comments  that  produced  idiosyncratic  responses  or 
marked  variance  in  responses  should  be  purged  from  the  list  for  any  subsequent 
use. 

Discussion 

Though  this  rating-scale  study  has  not  demonstrated  that  verbal  reactions 
by  someone  else  to  a  person's  task  performance  can  function  as  verbal  conse- 
quators  to  influence  the  individual's  subsequent  performance,  it  opens  the  way 
for  investigating  that  hypothesis.  As  common  experience  tells  us,  it  was 
shown  that  such  verbal  reactions  differ  in  the  extent  to  which  they  affect 
people  hedonical ly ,  that  is,  the  ways  and  levels  in  what  people  say  they  feel. 
The  empirical  question  is  the  degree  to  which  such  verbal  comments  exert  in¬ 
fluence  as  corsequators  commensurate  with  their  influence  on  hedonic  self- 
reports.  Self-report  measurement  through  rating  scales  can  provide  some  basis 
for  selecting  potential  verbal  consequators  to  investigate  this  question. 

Although  this  exploratory  study  should  be  replicated  on  a  wider  scale, 
with  more  participants  and  some  revisions  in  the  verbal  reactions,  the  present 
study  has  suggested  several  significant  factors  to  consider  both  in  a  more 
definitive  inquiry  and  in  experimentation  on  the  behavioral  influence  of  ver¬ 
bal  consequators. 

(1)  A  comment  from  another  person  will  be  reported  as  arousing  different 
amounts  of  feeling  according  to  the  stated  (and  perhaps  experienced! 
difficulty  level  of  the  task  toward  which  tiie  comment  has  been  directed. 

(2)  The  same  complimentary  comment  will  produce  greater  reported  hap¬ 
piness  when  the  task  was  a  difficult  one  than  when  it  was  an  easy  one. 

(3)  The  same  derogatory  comment  will  produce  less  reported  unhappiness 
when  the  task  was  a  difficult  one  than  when  it  was  an  easy  one. 


•1 


(4)  The  same  comment  can  produce  differences  in  reported  happiness  or 
unhappiness  when  tasks  themselves  differ  (though  this  conclusion  was 
confounded,  in  this  study,  by  a  difference  between  the  sources  of  com¬ 
ments  as  well--a  variable  that  merits  further  investigation). 


I 
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(5)  Some  terms  (e.g. ,' "wrong")  that  have  been  employed  as  information 
feedback  in  studies  of  the  effects  of  knowledge  of  results  seem  to  ex¬ 
ert  strong  hedonic  effects--which  may  or  may  not  have  confounded  the 
information  variables  in  those  studies,  since  their  antonyms  (e.g., 
"right")  appear  to  have  little  or  no  hedonic  influence.  In  short, 
antonymous  verbal  reactions  do  not  necessarily  exert  equivalent  hedonic 
impact,  partly  because  the  complimentary  member  of  the  pair  may  produce 
little  of  this  whereas  the  derogatory  member  may  exert  a  great  deal. 

How  general  this  bias  is  remains  to  be  ascertained. 

(6)  People  differ  considerably  in  their  rating  reactions  to  the  verbal 
reactions  of  others  about  their  task  performance,  even  within  a  small 
group  of  working  women.  The  causes  of  such  differences  presumably  ex¬ 
tend  beyond  differences  in  handling  a  rating  scale  and  in  familiarity 
with  terms. 

Rating  scales  of  potential  verbal  consequators  might  serve  another  pur¬ 
pose  besides  the  experimental  investigation  of  conscquation  variables.  They 
might  be  able  to  distinguish  among  individuals  with  respect  to  their  sus¬ 
ceptibility  to  such  variables,  that  is,  the  extent  to  which  such  variables 
influence  performance  differentially  among  individuals.  Thus,  they  could  con¬ 
stitute  a  personality  test  for  measuring  a  significant  human  attribute  that 
has  only  recently  begun  to  get  attention  among  investigators  of  human  differ¬ 
ences,  as  in  studies  of  internal-external  locus  of  control.  Ratings  of  verbal 
consequators  should  be  related,  through  studies  with  large  samples  of  partic¬ 
ipants,  to  demographic  and  other  personality  variables. 

Because  individuals  can  apply  verbal  consequators  to  themselves,  and 
probably  often  do  so  covertly,  rating  scales  such  as  those  in  this  study  might 
also  be  used  to  examine  self-worth,  self-esteem,  and  self-concept  as  these 
vary  in  task  performance.  Likewise  it  is  possible  they  could  be  adapted  to 
personnel  selection  in  organizations. 


-26- 

MAPPING  OF  CONSEQUATION  VARIABLES 


One  of  the  important  by-products  of  experimental  research  is  the  cog¬ 
itation  that  results  from  hands-on  work  in  addition  to  arm-chair  verbali¬ 
zation  and  along  with  the  intellectual  exploration  that  accompanies  reading 
the  research  literature.  Although  consequation  has  often  been  conceptual¬ 
ized  as  a  fairly  simple  construct  (Law  of  Hf feet ,  Reinforcement,  or  Knowl¬ 
edge  of  Results),  anyone  starting  to  do  experimental  research  in  this  field 
rapidly  discovers  it  covers  a  very  large  number  of  variables  and  states  of 
variables.  The  present  investigator  discovered  also  that  the  domain  had 
never  been  fully  mapped--that  is,  all  the  related  variables  and  their  states 
had  never  been  listed  in  one  publication,  though  Meister  (1976)  and  Pritchard 
and  Mantagno  (1978)  have  gone  further  than  most. 

It  seems  important  to  have  such  a  map  so  one  will  know  where  one  is 
going  and  not  get  lost--that  is,  know  what  variables  might  be  considered  for 
manipulation  and  what  ought  to  be  controlled.  In  the  course  of  the  present 
project  all  probable  consequation  variables  and  their  states  were  collected 
from  a  large  variety  of  sources,  carefully  categorized,  and  published  in  a 
technical  report,  Variables  in  human  consequation/fecdback .  This  report  con¬ 
sisted  principally  of  four  comprehensive  tables.  Since  the  technical  report 
is  readily  available,  its  contents  will  not  be  reproduced  here  except  to  list 
the  major  headings. 


Table  1:  Consequation/Feedback 

A.  Extent,  Amount  (7  subdivisions,  17  components) 

B.  Comparison  (4  subdivisions,  IS  components) 

C.  Type  (2  subdivisions,  9  components) 

D.  Source  (5  subdivisions,  11  components) 

Table  2:  Action  Relationships 

A.  Purposes,  Effects  of  Consequation/Feedback  (4  sub¬ 
divisions,  19  components) 

B.  Action  Aspects  (6  subdivisions,  37  components) 

C.  Action  Relations  (10  subdivisions,  35  components) 


Table  3:  Potentiation  Relationships 

A.  Potentiation  Aspects  ("  subdivisions,  32  components) 

B.  Potentiation  Relations  (3  subdivisions,  11  components) 


Table  1:  Consequation  Context 

A.  Referent  (3  subdivisions,  (■>  components) 

B.  Receiver  (2  subdivisions) 


Note:  Many  components  have  sub-components. 


A  further  task  would  be  to  depict,  to  the  extent  research  provides  evi¬ 
dence,  how  the  many  valuables  and  their  states  arc  interrelated.  That  would 
be  a  major  project  in  itself. 


DI-TliCTION  STUDY 


This  exploratory  study  investigated  how  various  aspects  of  consequat ion/ 
information  bcedback  affected  performance  in  a  detection  task.  The  task  con¬ 
sisted  of  finding  on  one  sheet  of  random  numbers  differences  (changes J  from 
the  numbers  on  another  sheet.  The  numbers  were  printed  in  ten  rows  on  a 
sheet.  Feedback  was  varied  in  two  ways:  (1)  An  indication  that  the  subject 
failed  to  detect  a  difference  within  a  string  of  numerals,  shown  immediatelv 
to  the  subject  reading  that  string.  The  subject  received  or  did  not  receive 
such  feedback.  (2)  A  comment  at  the  end  of  the  row  that  appeared  if  the  sub¬ 
ject  had  not  detected  a  difference.  There  was  (a)  no  comment,  (hi  the  comment 
"Miss,"  or  the  comment  "Idiot."  The  word  "Miss”  was  presumed  to  be  more  or 
less  neutral  hedonically,  whereas  "Idiot"  had  been  rated  by  a  number  of  other 
subjects  earlier  as  one  that  would  have  made  them  extremely  unhappy  if  an 
associate  had  used  it  to  characterize  a  job  they  had  done. 

Procedures 


Indications  that  a  subject  had  failed  to  detect  a  difference  consisted 
of  filled  rectangles  that  appeared  under  a  numeral  string  if  the  subject  ran 
a  special  marker  device  under  the  string.  The  subject  was  instructed  before¬ 
hand  to  refrain  from  running  the  marker  under  a  string  in  which  he/she  saw  a 
diftcrcnce.  Thus,  the  filled  rectangle  would  appear  only  if  the  subiect  failed 
to  detect  a  difference,  and  the  subject  would  get  immediate  feedback  to  that 
effect.  There  was  no  feedback  for  false  positives,  where  the  subject  re¬ 
frained  from  underlining  a  string  where  there  was  no  difference,  but  these 
could  he  counted  as  well  as  the  misses. 

The  subject  was  instructed  that  if  a  rectangle  did  appear,  he 'she  should 
continue  moving  the  marker  from  the  end  of  the  row  to  the  edge  of  the  paper. 

In  the  condition  of  no  comment  feedback,  no  word  appeared.  In  the  condition 
of  comment  feedback,  either  "Miss"  or  "Idiot"  appeared,  depending  on  the 
perimental  condition.  Neither  of  these,  however,  would  appear  unless  the  sub 
ject  had  failed  to  make  a  detection. 

The  numeral  sheets  on  which  the  subject  used  the  marker  were  prem.irktd 
and  preprinted  with  a  chemical  (A.B.  Dick  latent  image  process  1  that  made  the 

rectangles  and  words  invisible  unless  the  marker  passed  over  them. 

Subjects  were  told  initially  that  the  experiment  was  c-r-em  i.il  b  in  in 
vest  i  gat  ion  of  pattern  perception.  The  right-hand  sheet  that  t  »u  •  iil"o  t 
marked  had  different  spaeings  between  strings  of  numerals  fro::  t  be  1>  ft  niui 
page  with  which  the  subject  compared  it.  \s  the  subject  w a  ■;  informed,  *  he 
spaeings  differed  in  the  number  of  spaces  and  In  presence  or  ah  n  t'c»  .>•  a  pa,. 
In  addition,  again  as  the  subject  was  told,  the  order  two  .inn  i  •  h 
reversed.  The  reason  for  the  instruction  about  pat  t  >  m  -  eiwept  ,en  w  <  t  ii 
ducc  the  subject  to  scan  a  line  fairly  rapid  1  \  as  t  hot;  b  r<  td :  a  ,  1  ■ 
print,  and  thus  to  minimize  the  importance  of  *U  t  «vt  i  n  >! :  f :  i  :  n  n 
numerals  themselves.  An  invisible  filled  circle  w.i-.  or.  rl  «•  :  .  i .  -v-a, 

location  where  there  was  a  pattern  difference  between  t  !.■  e,  t  ■ 

and  the  comparison  { left  -  hand  1  page.  I  lie  sipvn.t  .or  t  old  '  ran  t  i  •  .  ;T  i 
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Subsequcnt  to  ten  one-hour  preliminary  sessions  involving  ten  participants 
(sec  below),  two  participants  took  part  in  this  study,  ami  a  third  participation 
was  planned.  For  each  participant,  there  were  nine  one-hour  sessions,  in  one 
case  all  on  the  same  day  with  approximately  ten  minute-  between  sessions,  in 
the  other  case  with  intervals  between  sessions  ran;.;  i  nr.  from  a  half  dax  to 
several  days.  The  data  had  not  been  processed  at  the  time  of  writ  ini',  this 
report.  There  wore  six  experimental  conditions,  m  the  order  MK'IM  I  !  I»G  ,  as 
described  in  Table  1,  with  participants  encountering  seven  to  nine  pa i red 
pages  of  random  nuimr.il  per  session.  flu  .  were  timed  land  told  so)  "to  see 
how  fast  you  can  go  through,  each  page."  starting  each  page  when  the  invest  i  .'.it  or 
said  "Go”  and  announcing  "I  ini  shed"  upon  conn  let  ion,  then  proceed  in.  to  tin.  in  \t  . 
They  were  also  instructed  to  retrain  from  going  Gael  ward  wit:. m  a  row  or  to  an 
earlier  row.  They  were  not  told  about  an>  pattern  or  numeral  ;  t «, f«  edback 
until  just  before  the  session  in  which  that  feedback  ■  i.airrnl,  uni  the.  wen 
not  told  in  adv  UK  e  .ibeiilt  tin  end-of-row  word  tVedn.ie  1  \t  Tin  end  ol  t  hi 
session  t  he  >  re-*  lied  to  six  queries  lbout  the  expenrent  ,  ltu  li.dtng  t  hi  i  r 
reactions  to  the  different  kind-,  of  feedlsu  1  ,  winch  t  hi.  .  rited  -n  ..  I  :.nod.»  1 

9-point  scale  ext  end  i  n-.,  from  a  -taterient  t  hat  t  he  ft  t  d‘  .n  k  made  the  put  n  leant 

"extremely  unhapjp.  ,"  through  neutral,  to  "extreme  1.  happy." 

Slat  er  i  a  1  < 
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Table  4.  Detection  stud)  Sessions  .mil  condition- 


Session  1 : 


rI  pages.  Condition  A:  pattern  recognition  only  and 
back.  Conhm.it  K'n:  a  through  i  in  that  or  lor.  No  ft 
on  Changed  j  age:; .  hanie  instructions. 
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and  three  of  one  space  between  strings  instead  of  two.  (In  the  unchanged 
sheets,  two-thirds  of  the  spneings  between  strings  were  single  spaces,  one- 
third  were  double  spaces).  There  were  two  instances  on  each  sheet  of  t  lie 
following:  leaving  a  space  and  creating  two  strings;  insertion  of  a  space 

within  a  string,  creating  two  strings;  omission  of  spacing  between  strings, 
creating  one  string  instead  of  two;  and  inserting  a  numeral  between  strings, 
creating  one  string  instead  of  two.  Pattern  differences  were  allocated  ran¬ 
domly  in  the  same  fashion  as  numeral  differences.  There  were  14  per  page. 

The  production  process  consisted  of  marking  off  rows  in  a  table  of  random 
numbers  into  numeral  strings  of  the  various  lengths.  String  lengths  were  al¬ 
located  by  rotating  the  ten  string  lengths  (four  to  13  numerals)  through  the 
approximately  100  string  positions  on  a  sheet  with  a  random  number  table.  As 
noted,  there  were  four  different  string  arrangements,  each  with  two  random 
numeral  selections.  The  eight  divided-up  random  number  tables  were  typed  info 
a  word  processor.  The  numeral  and  pattern  differences  were  then  put  into  the 
word  processor,  after  it  first  produced  10-row  original  sheets  without  them, 
to  create  the  difference  sheets.  The  word  processor  then  produced  two  addi¬ 
tional  row  arrangements  (complete  vertical  reversals  and  reversals  within 
pairs  of  rows)  for  'noth  the  original  and  difference  conditions.  Thanks  to 
the  word  processor,  retyping  of  entire  pages  was  not  required  for  either  the 
two  row  conditions  involving  reversals  or  for  the  insertion  of  pattern  and 
numera 1  d i f fercnces . 

Pre linn  nary  Sessions 

Due  to  the  novelty  of  the  technique  of  detecting  differences  among  numbers 
and  patterns  in  comparing  two  arrays  of  random  numbers,  it  was  necessary  to 
conduct  some  one-hour  preliminary  sessions  with  ten  participants  (working 
women  in  a  suburban  t cchn iea ! / i ndust r i a  1  park),  to  gain  familiarity  with  the 
technique.  Cneerta int  ics  included:  (1)  what  proport i on s  of  misses  (failed 
detection-')  might  be  expected;  iJl  how  varying  densities  of  differences 
(percentages  o.’  the  numbers  m  an  array )  might  affect  the  miss  rate;  (.V) 
whether  M'me  kind-  of  differences  would  result  in  more  or  fewer  misses  than 
others,  or  perhap-  virtually  none.  (!)  how  long  it  would  take  participants  to 
compare  two  arrays,  and  in!  what  would  happen  it’  time  limits  were  placed  on 
t  he  comp. i  r  l  i  on s . 
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The  results  showed  an  overall  median  percentage  of  misses  (with  no  time 
limit)  at  30,  indicating  that  this  technique  could  produce  a  substantial  number 
of  misses;  the  medians  across  participants  for  tlie  three  densities  were  30  per¬ 
cent,  40  percent,  and  20  percent.  The  median  among  participants  (with  no  tune 
limits)  was  31,  with  a  range  from  10  to  40  percent;  part ic ipant /dens i t y  per¬ 
centages  ranged  from  10  t  03  percent.  There  were  indications  that  densities 
of  20  and  45  resulted  in  somewhat  higher  miss  percentages  than  t ho  density  of 
10  differences  per  page,  hut  there  was  no  evidence  that  percentages  increased 
with  density  above  20.  The  imposition  of  a  time  limit  seemed  to  make  little 
difference  in  the  total  of  misses  or  of  false  positives,  when  i  ncorn|  1  rt  i .  >n  • 
were  not  counted  as  misses.  Time  limits  that  gave  30  percent  less  time  to 
complete  a  page  resulted  in  many  incompletions,  and  the  same  could  occur  with 
smaller  reductions.  It  was  apparent  that  imposing  a  time  limit  in  this  task 
could  be  a  tricky  operation  if  it  was  desirable  to  avoid  i ncomp 1 et i on s  ,  because 
judging  the  appropriate  limit  would  be  difficult.  Misses  were  found  to  be 
distributed  among  types  of  differences  within  a  range  of  10  to  23  percent 
(except  for  two  of  the  spacing  differences,  which  were  missed  much  more  fre¬ 
quently).  However,  there  was  some  variation  among  participants.  Ml  in  all, 
though  these  outcomes  were  based  simply  on  inspection  of  the  data  and  should 
not  be  viewed  as  conclusive,  they  aided  the  investigator  in  developing  the 
procedures  for  the  study,  described  earlier,  that  would  follow.  -\s  in  all  pilo 
work,  they  also  produced  some  cautions:  (1)  Don't  let  participants  point  at 
changed  strings  with  a  pencil  or  a  finger  in  a  numbcr-hy-nuniher  scrutiny,  and 
(2)  don't  select  as  participants  accounting  or  purchasing  personnel  who  work 
with  numbers  all  day  long,  every  working  day. 

Background 


In  human  information  processing,  a  frequent  and  important  task  is  the 
detection  of  mistakes  in  alphanumeric  materials.  This  task  occurs  in  the 
self-detection  of  material  being  typed  by  secretaries  and  stenographers  in 
transcribing  from  shorthand,  longhand,  or  other  typed  cony,  or  being  typed 
by  the  authors  of  the  material  themselves;  and  it  occurs  in  post -typing 
proofreading  In  clerical  personnel,  authors,  printers,  and  profess ionn 1 
proofreader-  for  publishers.  In  naval  and  other  milit.ir>  operations,  failure 
to  detect  errors  commit  ted  in  preparing  coded  or  textual  mat  or  in  1  could  result 


111 

some 

m  i  i  > 

fort une  . 

Bourne 

as 

h  i  gh 

23  fierce 

nt  m  a 

lor 

t  h 

i s  st  inly 

,  it  was 

det 

ec  t  i on  . 

I  he  re  h 

ave  been 

of 

no  t  m; 

,  << 

la-.--  and 

s  1  lice  r 

error-,  t 

iu  ■ 

e  l  I) \  e  t 

1  gat  or  ■ 

from  1  he 

vlr 

at  it  and 

!  i  ft  .  ■ ; 

(1<) 

T  1  ,'  1 

)  ul  so  l, 

sed  snv 

and 

part 

1  k  ! 

pant  in 

in.  -t  ’’.er 

work  d  i 

v-  IV 

p  itn  t 

'  i  t  1.1',  '' 

ab- 

t  nn  ; 

t‘.l 

'  Ii  ion  .  ! 

it’.  1  '1 

of 

mat  i  i: 

1  n , 

■ip,’.  T  : 

nd  !  s.i  r 

exp 

ii  i  i 

]'*  \ 

!  rn’.. 

in  irt 

t  o 

det  i  i 

*  v 

On  1 

v  I 

t*  i  >  »  >  l  ’  V 

.  -  .  •  t  ■  ] 

to 

the  1 

!•“ 

U't  1  ’>  .  • 

!  1  i t "1 

ud  i  inert  <  d  ir.t  •  -even  t  vvd  "use-  Iran  c  r  :  In  d 
•  r ■  it  V’ler  ;  c.uf  c  >  t  :  'n  I ,»!*.«.*  I  icon  ,  i*c  i  r.  ■  1  e  i  n 

is  mu.  ;  tiTia  : : '  l  -.o  .  .  tote-;  iii'i  -r  i'1'iu't  ■  , 
r. .  ■  i  -  i  t  od  .  (Moran  ,  loci’,  1  1  1  ~  ”  !  h  id.  to  d  i  t  c  c  t 
i  r,  or  i  1  ;•  i  a.  -  i  n  t  .  i !  1 1  >n  i  :\!  text  .  In  .  non 


I'!.;  . i  . >ncr 


i:  id  t  hi 


••  .d  *  lie  i 


-32- 


very  sparse;  an  inquiry  that  included  the  local  chapter  of  the  International 
Typographical  Union  and  several  professional  editors  for  organizations,  as 
well  as  the  Government  Printing  Office,  and  1JSDA  Graduate  School  (both  of 
which  give  courses  in  proofreading),  turned  un  only  a  rather  superficial  text 
by  Lasky  (1954) .  Thompson  et  al.  (1979)  reported  a  study  pertinent  to  the 
present  one  in  which  a  typist  proofread  her  own  copy  for  errors  and  recorded 
the  errors  she  found  and  whether  the  letters  in  which  errors  occurred  were  re¬ 
turned  to  her  for  retyping.  This  post-typing  scl f-mon i tor i ng  resulted  in  a 
considerable  decrease  of  omission  and  substitution  (but  not  grammatical)  errors 
and  of  pages  returned  to  be  retyped. 


The  most  significant  psychological  study  found  in  the  literature  was  by 
Crosland  (1921),  which  was  discovered  after  the  preliminary  worl  in  this  st  ud\ 
was  completed.  Gros land  inserted  into  textual  material  from  several  types  of 
periodicals  M  kinds  of  typographical  errors,  including  mi s«pe  J ] i ngs ,  punctu¬ 
ation  errors,  word  omissions,  and  wrong  type  faces,  with  a  density  of  about 
100  or  SO  per  page.  His  SO  participants,  in  five  groups  of  differing  experience 
with  printed  materials,  read  and  marked  the  errors  they  found.  The  overall 
percentage  of  misses  for  three  different  kinds  of  materials  with  the  higher 
density  were  25.’,  32.’,  and  35.6,  and  the  percentage  for  the  lower  densit} 
was  45.4.  Individual  differences  were  extensive.  The  investigator's  voluminous 
report  contains  analyses  of  the  effects  of  different  kinds  of  typographical 
errors,  different  kinds  of  readers,  rereading,  and  practice. 

The  present  study  was  based  on  random  numbers  instead  of  textual  material 
because  it  had  a  w i t h i n- sub j ect  experimental  design  and  it  was  necessary  to 
obviate  the  learning  effects  that  would  presumably  occur  if  participants  en¬ 
countered  the  same  text  repeatedly,  with  feedback  about  misses.  It  also  seemed 
desirable  to  exclude  typographical  errors  that  might  depend  on  an  individual's 
educational  level  and  that  might  be  subject  to  "learning"  during  the  study, 
such  as  spelling  and  grammatical  errors  due  to  ignorance.  Certain  other  types, 
such  as  type  font  and  type  errat  ieitv,  were  also  excluded  a--  infeasible  with 
typed  copy,  hnphasis  was  placed  on  errors  that  a  part i c i pant  -  reader  would 
readily  acknowledge  as  errors  it  they  were  pointed  out  to  h»  r - -because  a 
literate  reader  would  assuredly  have  learned  the  correct  version  early  in 
life.  To  get  a  handle  on  these,  various  sources  were  examined,  including 
the  errors  Glass  and  Singer  had  introduced  into  the  .lane  Picohs  text.  Odd  1  >  , 
a  valuable  source  was  the  2~th  annual  report  of  t  lu-  hrgonoroic s  Research 
Society  (197(>),  which  contained  IOC  t  >  pograph  i  ca  1  errors, 

Analyses  ot  t  lie  sources  showed  that  the  t  vpes  of  mi::u  r.i  I  errors  incut  i  oned 
carper  in  t  lie  description  of  materials  n>  the  present  tinn  iv're-ented  tin 
most  numerous  letter  error-  to  S-  found  i  n  textual  mit.riil.  !  m  example, 
omissic.n  or  insertion  ot'  a  numeral  in  a  striae  of  Hirer  1 1  ••  c  re'ren  rt 

the  omission  or  insert  inn  ot  i  letter  in  a  word .  tr:m-.iv -at  ■<*'  tun  nnia  r.i  1  - 
could  represent  t ran-pos i t  i an  ot  letters,  and  repetition  at  i  r.-er.il  rent  tit  ion 
of  a  letter.  Pattern  d  i  ( t'erenc  ■■  in  tin  irra  ■  oi  number  ..are  ,!o\  cloned  a 
analog'-  ot  textual  pa  t  t  c  rn  d  i  t  !  e  rein  e  imnh  ;  n  si.  m  .  -nav*  ait  i.n,  aid. 

word  re  \  e  r  --  1 1  s  ,  Part  i  .•  i  part  •  were  a  k .  d  to  dot  ret  er  !  >.  it  •  t  v,  i  ft'cia  t 

1  n  I  !  I  a  1  !  \  hec  111  I  raid,!-,  a  f  text  t  <  r,d  t  ■  ■  :a  id  war!  .  ;  :  •-  !  'a  l  t  "a 

t  he  c  omporterf  let  !  .  r  .  !  bat  i  an  M”  i :  I  .•  ”  .  i  r  : . r  •  .lie  Tad.’,  t 

comp  alien  t  er  mi  1 1  w  IS  booed  t  I-  it  ln.lt  i  I  t  ;  ,  .  a  T  1  ’  T  "d,  a .  ’  T  , 

overlook  in  i -a  r  i  1  ■  nor-  ■  :  t  !  u  u  1 1  t  -  n  t  i .  -  t  w  .  t  :  :  t  •  ■  a  T  '<■  ti  n 

ot  numeral,  rat  in  r  than  ni.io  wia.d  mi:-  r  a  ! 
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CODING  STUDY 


The  purpose  of  this  study  was  to  find  out  how  various  kinds  of  verbally 
aversive  feedback  ( consequat ion )  would  affect  accuracy  performance  in  a  cod¬ 
ing  task.  That  task  consisted  of  transformations  between  two  alphanumeric 
codes,  including  both  encoding  and  decoding,  without  elaboration  or  reduction 
and  independent  of  categorization.  The  codes  consisted  of  randomly  selected 
numbers  and  letters,  a  tactic  tnat  has  been  favored  before  (Anderson,  1980). 
Since  such  codes  are  already  we] 1  - 1  earned  in  literate  participants,  it  was 
assumed  that  feedback  would  affect  performance  only  and  not  also  involve 
learning.  As  in  other  studies  in  this  project,  the  random  selection  of  dif¬ 
ferent  but  equivalent  code  components  would  make  it  possible  to  generate  in¬ 
put  materials  that  the  same  participant  could  repeatedly  encounter  without 
memorizing  particular  inputs  that  would  make  subsequent  performance  easier. 

But  as  in  the  other  studies,  it  had  not  been  foreseen  how  difficult  it 
would  be  to  generate  and  present  information  processing  problems  to  partici¬ 
pants  so  they  would  continue  to  make  enough  errors  to  reflect  differential 
effects  from  the  states  of  an  independent  variable  involving  consequat ion. 

As  a  result,  all  nine  three-hour  sessions  that  were  conducted,  with  eight 
participants,  were  devoted  to  trying  to  find  a  task  that  would  occasion 
enough  errors,  and  the  study  never  reached  the  point  of  introducing  the  con¬ 
sequat  ion  variable.  However,  the  process  of  seeking  a  viable  method  to  as¬ 
sure  participant  errors  has  some  interest  in  itself. 

The  process  took  place  in  two  parts.  In  each,  the  codes  were  the  eight 
numbers  2-9  and  24  letters  (A  and  2  being  omitted),  the  same  eight  letters 
occurring  in  each  problem.  In  the  first,  with  seven  sessions  and  as  many 
participants,  a  participant  encountered  four  pairs  of  rows,  one  row  of  numer¬ 
als  and  one  row  of  letters,  alternating  in  vertical  order.  The  participant 
started  with  one  of  the  numbers  in  the  top  row,  identified  the  letter  below 
it,  found  this  letter  in  another  row  of  eight  letters,  selected  the  numeral 
in  the  row  under  it,  found  that  number  in  another  row  of  eight  numerals, 
selected  the  letter  under  it,  found  that  letter  in  another  row  of  letters, 
and  selected  the  numeral  under  it.  That  was  the  correct  answer.  To  register 
it,  the  participant  found  and  marked  it  in  an  8  x  8  matrix  of  numerals  on  the 
answer  sheet.  The  participant  did  this  for  all  eight  top-row  numerals.  In 
these  transformations  (encoding  and  decoding),  the  participant  relied  not  on 
memory  of  intercode  translations  but  on  similar  positions  in  a  row,  which 
kept  changing.  Thus,  the  alphanumeric  transformation  involved  position  cod¬ 
ing,  that  is,  sequential  positions.  Very  few  errors  resulted.  Speeding  up 
the  process  by  giving  small  amounts  of  money  as  positive  reinforcers  for  go¬ 
ing  faster  or  by  taking  money  away  for  not  going  fast  enough  still  failed  to 
produce  many  errors.  Participants  reported  they  were  trying  to  be  accurate, 
though  they  had  received  no  instructions  to  this  effect. 

A  number  of  alternative  schemes  were  tried  out.  (11  There  were  trans¬ 
formations  from  numeral  to  numeral  and  from  letter  to  letter.  That  i s ,  in 
the  former  com-  a  participant  would  select  a  number  below  a  number,  find  the 
corresponding  number  in  another  row,  select  the  number  below  that  .  and  so  on. 
(2)  Rows  ot  numerals  and  letters  were  displaced  so  corresponding  positions 
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in  rows  (the  coding  transformation)  were  not  vertically  in  line.  (3)  Let¬ 
ters  and  numerals  were  arranged  in  columns  rather  than  rows.  (4)  New  pairs 
of  rows  were  placed  on  successive  pages  of  a  booklet  rather  than  on  the  same 
sheet.  (5)  The  participant  had  to  make  transformations  between  three-number 
units  and  three- letter  units.  (6)  Participants  had  to  proceed  from  right  to 
left  instead  of  left  to  right  in  selecting  numerals  initially.  (7)  A  clock 
code  replaced  the  24- letter  code.  None  of  these  various  arrangements  raised 
the  error  rate  from  0-5Dij. 

(8)  An  arithmetic  operation  was  introduced.  Each  time  the  participant 
made  a  transformation,  he/ she  had  to  add  or  subtract  one  number  or  one 
letter  to  get  the  next  higher  or  lower  one  in  sequence  for  the  next  transfor¬ 
mation.  (9)  The  total  number  of  transformations  was  doubled  or  tripled  by  re¬ 
quiring  the  participant  to  recycle  through  the  material  for  each  of  the  eight 
starting  numerals  in  each  problem.  (10)  Double  recycling  in  combination  with 
the  arithmetic  operation  generated  the  largest  proportion  of  errors  but  these 
included  errors  due  to  the  participant's  forgetting  where  he/she  was  in  the 
recycling  process,  a  type  of  error  different  from  simply  making  a  transforma¬ 
tion  error.  In  addition,  this  procedure  required  five  minutes  per  problem 
of  eight  trials.  (11)  A  preferable  combination  was  a  single  recycle  with  the 
arithmetic  operation  and  some  monetary  reinforcement  to  go  fast.  This  pro¬ 
duced  5-15°0  error.  However,  each  problem  of  eight  trials  required  three 
minutes . 

It  was  estimated  that  instead  of  the  total  time  originally  set  aside 
for  testing  the  feedback  variable,  at  least  twice  as  much  time  would  be  needed 
per  participant  (and  for  the  coding  task)  with  either  a  larger  number  of 
shorter  problems/trials  yielding  the  lower  error  rates  or  with  about  the  same 
number  of  problems/trials  with  each  taking  longer  and  yielding  the  higher 
error  rates.  An  alternative  would  be  to  reduce  the  number  of  participants, 
running  each  of  four  or  five  (instead  of  ten)  in  a  time-series  multiple  base¬ 
line  design  divided  between  the  two  arrangements. 

In  the  two  sessions  of  the  second  part  of  the  exploration  process,  the 
single  participant  encountered  (>1  pairs  of  numerals  and  letters  in  blocks  of 
eight  rows  and  eight  columns.  In  a  random  fashion,  in  half  the  pairs  the 
numeral  was  first,  in  the  other  half  the  letter.  No  numeral  or  letter  was 
repeated  within  a  row.  It  was  thought  this  arrangement  might  lead  to  more 
processing  errors.  The  participant  followed  these  instructions;  the  inves¬ 
tigator  wrote  down  the  participant's  responses; 

When  I  say  "Go,"  call  out  the  first  numeral  in  the  top  row.  Immedi¬ 
ately  next  to  it  is  a  letter.  Find  that  letter  in  the  second  row. 
Immediately  next  to  it  is  a  numeral.  Find  that  numeral  in  the  third 
row.  Immediately  next  to  it  is  a  letter.  Find  that  letter  in  the 
next  row.  Proceed  in  this  way  until  you  come  to  the  bottom  row.  Gall 
out  the  numeral  or  letter  you  end  up  with.  That  letter,  or  the  letter 
next  to  the  numeral  you  called  out,  is  the  one  that  corresponds  to  the 
numeral  you  began  with. 

Go  through  this  procedure  as  fast  as  you  can.  As  soon  as  you  have 
called  out  the  bottom  numeral  or  letter,  start  the  procedure  again 
with  the  second  numeral  in  the  top  row.  Repeat  until  you  have 
started  with  all  eight  numerals  in  the  top  row.  If  you  start  with 


the  wrong  numeral.  I'll  correct  you.  You  will  be  timed  for  each  time 

you  go  through  the  rows.  Once  you've  proceeded  from  one  row  to  the 

next,  don't  go  back,  even  if  you  think  you  got  mixed  up. 

The  participant,  who  was  not  permitted  to  write  anything  down,  traversed 
each  block  eight  times,  and  completed  eight  blocks  (64  responses)  without  mak¬ 
ing  a  mistake,  except  in  the  first  two;  the  time  to  complete  a  block  dropped 
by  25  percent.  The  participant  also  made  transformations  starting  with  a 
letter  in  the  top  row  and  ending  with  a  numeral  in  the  bottom  row,  with  only 
one  error  in  four  blocks  (52  responses).  In  one  numeral -first  and  one  letter- 
first  block  there  was  a  space  between  numeral  and  letter  within  a  pair  and 
two  spaces  instead  of  one  between  pairs  in  a  row  and  between  rows.  This  spac¬ 
ing  had  no  apparent  effect  on  error  rate  or  time. 

In  a  subsequent  session  the  task  was  made  more  complex.  The  procedure 
was  similar,  but  the  participant  had  to  start  with  the  two  numerals  in  the 
first  two  numeral- letter  pairs,  find  in  the  second  row  the  two  corresponding 
letters,  paired  there  with  different  numerals,  find  those  numerals  in  the  next 
row,  paired  there  with  different  letters,  and  so  on  to  the  bottom  row  of  the 
block.  Numeral- letter  order  still  varied  within  pairs.  Kith  this  task  the 
participant  began  to  produce  a  few  errors  and  took  almost  three  times  longer 
to  traverse  a  block.  The  increased  load  on  short-term  memory,  combined  with 
the  different  numeral- letter  orders,  presumably  was  the  cause. 

To  test  the  limits,  the  participant  was  instructed  to  follow  the  same  pro¬ 
cedure  but  with  three  numerals  at  a  time,  then  three  letters,  then  three 
numerals,  and  so  forth.  The  participant  made  more  errors,  got  "lost"  twice 
and  took  twice  as  long  as  before. 

In  a  further  variation  of  procedure,  the  participant  went  back  to  single 
numerals  and  letters  at  a  time  within  a  block  but  had  to  work  with  two  blocks 
at  a  time.  Parallel  transformations  had  to  be  made  in  each  block  (and,  as  in 
all  the  other  transformations,  held  in  memory  storage)  before  proceeding  to  the 
next.  In  one  traversal  the  subject  got  lost  three  times  and  made  six  errors 
otherwise  (out  of  16  responses),  taking  about  as  long  as  in  the  three-numeral, 
three- letter  procedure. 

A  final  procedure  consisted  of  combining  the  two-numeral,  two-letter  re¬ 
quirement  with  the  two-block  procedure.  The  participant  tried  one  traversal, 
got  lost,  and  said  "1  don't  think  1  can  do  this." 

Although  this  account  can  provide  no  conclusions,  being  based  on  one 
participant  and  a  small  number  of  trials,  it  may  point  the  way  toward  a  coding 
task  with  numeral  and  letter  codes  that  would  be  difficult  enough  to  generate 
errors  for  a  test  of  conscquation  effects. 
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